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T1R HETERO-OLIGOMERIC TASTE RECEPTORS, CELL LINES 
THAT EXPRESS SAID RECEPTORS, AND TASTE COMPOUNDS 

Cross Reference to Related Applications 

5 This application claims priority to U.S. Provisional Application Serial No 

60/494,071 filed on August 6, 2003, and U.S. Provisional Application Serial No 
60/552,064 filed March 9, 2004, both of which are incorporated by reference in their 
entirety. 

Background of the Invention 

10 Field of the Invention 

The present invention; in part relates to the discovery that the T1R receptors 

assemble to form functional taste receptors. Particularly, it has been discovered that 

co-expression of T1R1 and T1R3 results in a taste receptor that responds to umami 

taste stimuli, including monosodium glutamate. Also, it has been discovered that co- 
15 expression of the T1R2 and T1R3 receptors results in a taste receptor that responds to 

sweet taste stimuli including naturally occurring and artificial sweeteners. 

Also, the present invention relates to the use of hetero-oligomeric taste receptors 

comprising T1R1/T1R3 and T1R2/T1R3 in assays to identify compounds that 

respectively respond to umami taste stimuli and sweet taste stimuli. 
20 The invention also relates to chimeras and truncated versions of T1R1, T1R2, 

and T1R3, as well as chimeras of T1R1/T1R3 and T1R2/T1R3 receptors comprising 

human, rat, or human and rat subunits. 

Further, the invention relates to the construction of cell lines that stably or 

transiently co-express a combination of T1R1 and T1R3; or T1R2 and T1R3, including 
25 truncated or chimeric versions of these subunits as well as chimeric receptors 

comprising wild-type or chimeric subunits; under constitutive or inducible conditions. 
The use of these cell lines in cell-based assays to identify umami and sweet taste 

modulatory compounds is also provided, particularly high throughput screening assays 

that detect receptor activity by the use of fluorometric imaging. 
30 The invention also relates to compounds that bind to T1R1/T1R3, T1R2/T1R3 

receptors, as well as T1R1, T1R2, and T1R3 chimeric and truncated subunits and 

chimeric receptors. 
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Description of the Related Art 

The taste system provides Sensory information about the chemical composition 
of the external world.. Mammals are believed to have at least five basic taste 
5 modalities: sweet, bitter, sour, salty, and umami. See, e.g., Kawamura et al, 

Introduction to Umami: A Basic Taste (1987); Kinnamon et al, Ann. Rev. Physiol, 
54:715-31 (1992); Lindemann, Physiol Rev., 76:718-66 (1996); Stewart et al, Am: J. 
Physiol, 272:1-26(1997). Each taste modality is thought to be mediated by a distinct 
protein receptor or receptors that are expressed in taste receptor cells found on the 
10 surface of the tongue (Lindemann, Physol. Rev. 76:718-716 (1996)). The taste 

receptors that recognize bitter, sweet, and umami taste stimuli belong to the G-protein- 
coupled receptor (GPCR) superfamily (Hoon et al, Cell 96:451 (1999); Adler et al, 
Cell 100:693 (2000)). (Other taste modalities are believed to be mediated by ion 
channels.) 

15 G protein-coupled receptors mediate many other physiological functions, such 

as endocrine function, exocrine function, heart rate, lipolysis, and carbohydrate 
metabolism. The biochemical analysis and molecular cloning of a number of such 
receptors has revealed many basic principles regarding the function of these receptors. 
For example, United States Patent No. 5,691,188 describes how upon a ligand binding 

20 to a GPCR, the receptor undergoes a conformational change leading to activation of a 
heterotrimeric G protein by promoting the displacement of bound GDP by GTP on the 
surface of the Ga subunit and subsequent dissociation of the Gcc subunit from the Gp 
and Gy subunits. The free Ga subunits and G0y complexes activate downstream 
elements of a variety of signal transduction pathways. 

25 The T1R receptors were previously hypothesized to function as sweet taste 

receptors (Hoon et al, Cell 96:541-51 (1999); Kitagawa et al., Biochem Biophys Res. 
Commun. 283:236-42 (2001); Max et al., Nat. Genet. 28:58-63 (2001); Montmayeur et 
al., Nat. Neurosci: 4: 412-8 (2001); Sainz et al., J. Neurochem. 77: 896-903 (2001)), 
and Nelson et al. (2001) and Li et al (2002) have recently demonstrated that rat and 

30 human, respectively, T1R2 and T1R3 act in combination to recognize sweet taste 
stimuli. 

However, there remains in the art a need for new and improved flavoring 
agents. For example, one of the five known basic tastes is the "savory" or "umami" 
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flavor of monosodium glutamate ("MSG")- MSG is known to produce adverse 
reactions in some people, but very little progress has been made in identifying artificial 
substitutes for MSG. It is known that a few naturally occurring materials can increase 
or enhance the effectiveness of MSG as a savory flavoring agent, so that less MSG 
5 would be needed for a given flavoring application. For example the naturally occurring 
nucleotide compounds inosine monophosphate (IMP) or guanosine monophosphate 
(GMP) are known to have a multiplier effect on the savory taste of MSG, but IMP and 
GMP are very difficult and expensive to isolate and purify from natural sources, or 
synthesize, and hence have only limited practical application to most commercial needs 
10 in food or medicinal compositions. Less expensive compounds that would provide the 
flavor of MSG itself, or enhance the effectiveness of any MSG that is present could be 
of very high value. Similarly, discovery of compounds that are either new "High 
Intensity" sweeteners (i.e. they are many times sweeter than sucrose) would be of 
value. 

1 5 What is needed in the art is the identification and characterization of taste 

receptors which function as sweet and umami receptors, assays for identifying 
compounds that modulate (enhance or block) sweet and umami taste, and the 
compounds that specifically bind to these receptors. 

20 Summary of the Invention 

The present invention provides chimeric receptors comprising various 
combinations of human and rat TIRs, such as a chimeric T1R2/T1R3 receptor 
comprising a human T1R2 subunit and a rat T1R3 subunit; a chimeric T1R2/T1R3 
receptor comprising a rat T1R2 subunit and a human T1R3 subunit; a chimeric T1R2 
25 receptor subunit comprising a human extracellular domain, a rat transmembrane 
domain and a rat intracellular domain; and a chimeric T1R3 receptor subunit 
comprising a rat extracellular domain, a human transmembrane domain and a human 
intracellular domain. 

The present invention also provides compounds that specifically bind to T1R1, 
30 T1R2, T1R3, T1R1/T1R3 and T1R2/T1R3, or isolated subunits, fragments, chimeras or 
truncated versions thereof as disclosed herein. 

The present invention relates to the discovery that different combinations of 
TIRs, when co-expressed, produce functional taste receptors that respond to taste 
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stimuli. Particularly, the present invention relates to the discovery that co-expression of 
T1R2 and T1R3 results in a hetero-oligomeric taste receptor that responds to sweet 
taste stimuli. Also, the present invention relates to the discovery that the co-expression 
of T1R1 and T1R3 results in a hetero-oligomeric taste receptor that responds to umami 
5 taste stimuli such as monosodium glutamate. 

The present invention also relates to cell lines that co-express T1R1 and T1R3, 
including human or rat, or T1R2 and T1R3, including human or rat. In preferred 
embodiments these cell lines will express elevated amounts of the receptors, either 
constitutively or inducibly. These cell lines include cells that transiently or stably 

10 express T1R1 and T1R3 or T1R2 and T1R3. 

Also, the present invention provides assays, preferably high throughput 
screening assays, that utilize the T1R2/T1R3 taste receptor, or the T1R1/T1R3 receptor, 
preferably high throughput cell-based assays, to identify compounds that modulate 
sweet or umami taste. The invention also provides assays that include taste tests to 

1 5 confirm that these compounds modulate sweet or umami taste. 

The invention also relates to compounds that bind to the N-terminal 
extracellular domain of T1R2, compounds that bind to the cysteine-rich domain of 
T1R2, compounds that bind to the Transmembrane Domain of T1R2, compounds that 
bind to the Transmembrane Domain of T1R3, compounds that bind to the 

20 Transmembrane Domain of T1R2 of a truncated receptor h2TM/h3TM, and compounds 
that bind to the Transmembrane Domain of T1R3 of a truncated receptor h2TM/h3TM, 
for example. 

Brief Description of the Figures 
25 Figure 1 contains a sequence alignment of human and rat TIRs, human 

calcium-sensing receptor and rat metabotropic glutamate receptor. 

Figure 2 contains RT-PCR amplification experimental results which show 
that hTlR2 and hTlR3 are expressed in taste tissue. 

Figure 3a - 3b contain functional data (intracellular calcium responses) 
30 elicited by different sweet taste stimuli in HEK cells stably expressing G a is that are 
transiently transfected with human T1R2, T1R3 and T1R2/T1R3 at various 
concentrations of sweet taste stimuli (Figure 3a); human T1R2/T1R3 dose responses 
for several sweet taste stimuli (Figure 3b); human T1R2/T1R3 responses to sucrose in 
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the presence of gurmarin, and endogenous p2-adrenergic receptor responses to 
isoproterenol in the presence of gurmarin. Figure 3c contains the normalized response 
to different sweeteners. 

Figure 4 contains intracellular calcium responses in HEK cells stably 
5 expressing Gal5, transiently transfected with hTlR2/hTlR3, rTlR2/rTlR3, 

hTlR2/rTlR3 and rTlR2/hTlR3 in response to 350 mM sucrose, 25 mM tryptophan, 
1 5 mM aspartame, and 0.05 % monellin. 

Figure 5 contains the results of a fluorescence plate reactor based assay 
wherein HEK cells stably expressing Gal5 were transiently transfected with hTlR2 
10 and hTlR3 or hTlR3 alone and contacted with the calcium dye Fluo-4 and a sweet 
taste stimulus (12.5 mM cyclamate). 

Figure 6 contains normalized dose-response curves which show that hTlR2 
and hTlR3 function in combination as the human sweet receptor based on their dose- 
specific interaction with various sweet stimuli (trp, cyclamate, sucrose, neotame, 
1 5 asparame, saccharin and Acek). 

Figure 7 contains structural information relating to mGluRl and T1R1 
showing the key ligand binding residues are observed in these molecules. 

Figure 8a-8c contains functional data showing HEK cells which stably 
express Gal 5 that are transiently transfected with T1R1/T1R3 respond to glutamate in 
20 an intracellular calcium-based assay. Figure 8a shows that intracellular calcium 
increases in response to increasing glutamate concentration; Figure 8b shows 
intracellular calcium responds to IMP (2 mM), glutamate (0.5 mM) and 0.2 mM IMP; 
and Figure 8c shows human T1R1/T1R3 responses for glutamate in the presence and 
absence of 0.2 mM IMP. 
25 Figures 9a-9b respectively contain the results of an immunofluorescence 

staining assay using Myc-tagged hTlR2 and a FACS experiment showing that the 
incorporation of the PDZEP peptide (SEQ ID No: 1) enhanced the expression of a T1R 
(hTlR2) on the plasma membrane. 

Figure 10a through 10b contain calcium imaging data demonstrating that 
30 hlTR2/hTlR3 respond to different sweet stimuli. 

Figure 1 1 shows the responses of cell lines which stably express hTlRl/hTlR3 
by automated fluorescence imaging to umami taste stimuli. 
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Figure 12 shows the responses of a cell line which stably expresses 
hTlR2/hTlR3 by automated fluorescence imaging to sweet taste stimuli. 

Figure 13 shows dose-response curves determined using automated 
fluorescence imaging for a cell line that inducibly expresses the human T1R1/T1R3 
5 taste receptor for L-glutamate in the presence and absence of 0.2mM IMP. 

Figures 14 and 15 show the response of a cell line that inducibly expresses the 
human T1R1/T1R3 taste receptor (1-17 clone) to a panel of L-amino acids. In Figure 
14 different C-amino acids at 1 OmM were tested in the presence and absence of 1 mM 
IMP. In Figure 15 dose-responses for active amino acids were determined in the 
10 presence of 0.2mM IMP. 

Figure 16 shows that lactisole inhibits the receptor activities of human 
T1R2/T1R3 and human T1R1/T1R3. 

Figure 17 shows schematics of human-rat T1R chimeras. The chimeras are 
constructed by fusing the human or rat extracellular domains to the rat or human 
1 5 transmembrane domains respectively, as shown in h2-r2, r2-h2, h3-r3 and r3-h3'. 

Figure 18 shows neohesperidin dihydrochalcone (NHDC) enhances the 
activities of T1R1/T1R3 umami taste receptor. [Neohesperidin dihydrochalcone] = 5 
^M. The glutamate dose response curve is left-shifted by 2.3 fold (left panel), and the 
glutamate/IMP dose response is left-shifted by 2. 1 fold. 
20 Figure 19 shows that control sweeteners do not affect the activities of 

T1R1/T1R3 umami taste receptor [Steviocide] = 0.5 mM. [Saccharin] = 1 mM. 
Glutamate dose response is shown in the left panel, and glutamate/IMP dose response is 
shown in the right panel. 

Figure 20 shows NHDC maps to the transmembrane domain of human T1R3. 
25 Figure 21 shows mapping of a compound to the human T1R2 transmembrane 

domain. 

Figures 22a-d show sweeteners which map to different domains/subunits of the 
human sweet receptor. Figure 22a shows responses of human and rat sweet receptors to 
sucrose (200 mM), aspartame (10 mM), neotame (0.1 mM), cyclamate (10 mM), and 
30 sucrose (200 mM) in the presence of lactisole (1 mM) (Suc/Lac). HEK-293T cells were 
transiently transfected with human or rat T1R2, T1R3, and a G a \s chimera Gaisnu and 
assayed for intracellular calcium increases in response to sweeteners. Figure 22b shows 
aspartame and neotame were mapped to N- teiminal extracellular domain of human 
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T1R2. Combinations of T1R chimeras were transiently transfected into HEK-293T cells 
with Gais/ii, and assayed for responses to sweeteners at the concentrations listed in 23a. 
The presence or absence of response is what is important. Figure 22c shows cyclamate 
was mapped to the C-terminal transmembrane domain of human T1R3. Figure 22d 
5 shows lactisole was mapped to the transmembrane domain of human T1R3. Different 
combinations of T1R chimeras were transiently transfected into HEK-293T cells with G 
G ais/ii, and assayed for responses to sucrose (200 mM) arid AceK (10 mM) in the 
absence or presence of lactisole (1 mM). The activities in B, C and D represent the 
mean ± SE of number of responding cells for four imaged field of -1,000 confluent 
10 cells. 

Figures 23a-d show mutations in T1R2 or T1R3 selectively affect the activity of 

i 

different sweeteners. Figure 23a shows sequence alignment of the N-terminal ligand 
binding domain of rat mGluRS with human and rodent TlR2s. The 8 critical amino 
acids involved in ligand-binding in mGluRS are labeled with *, three of the 8 amino 

15 acids are conserved in T1R2 and underlined. Figure 23b shows two point mutations in 
the human T1R2 N-terminal extracellular domain that abolish response to aspartame 
and neotame without affecting cyclamate. Stable cell lines of hTlR2/hTlR3 (WT), 
hTlR2 S144A/hTlR3 (S144A) and hTlR2 E302A/hTlR3 (E302A) were generated as 
describe in the Examples. The dose-responses of these stable lines were determined on 

20 FLIPR for sucrose, aspartame, neotame and cyclamate. The activities represent the 

mean ± SE of fold increases in fluorescence intensities for four recorded wells. Figure 
23c shows sequence alignment of human and rodent T1R3 transmembrane domains. 
The three extracellular loops are underlined and labeled ELI, 2, or 3, according to 
their order in the protein sequences. Figure 23d shows mutations in the extracellular 

25 loop of hTlR3 that abolish response to cyclamate without affecting aspartame. Each 
of the three extracellular loops of hTlR3 were replaced with rat protein sequence 
separately, and the resulting hTlR3 mutants were transiently transfected into HEK- 
293T cells together with G a i5/n, and assayed for responses to sucrose (200mM), 
aspartame (10 mM) and cyclamate (10 mM). The activities represent the mean ± SE of 

30 number of responding cells for four imaged field of -1,000 confluent cells. 

Figures 24a-b show human T1R2 is required for G 0 i5 -coupling. Figure 24a 
shows responses of human, rat and chimeric sweet receptors to sucrose (200 mM) . 
and AceK (10 mM). Stable G a \s cells were transiently transfected with human, rat or 
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chimeric TIRs, and assayed for intracellular calcium increases in response to 
sweeteners. Figure 24b shows G ais-couplirig is mediated by human T1R2, The 
activities represent the mean ± SE of number of responding cells for four imaged 
field of -1,000 confluent cells. 
5 Figures 25a-f show the effect of lactisole and cyclamate on the human 

T1R1/T1R3 umami receptor. Figure 25a shows the response of human T1R1/T1R3 
stable cell line to L-glutamate (5 mM) and L- glutamate/IMP (1/0.2 mM) in the absence 
and presence of lactisole (5 mM). Figure 25b shows the lactisole dose-dependent 
inhibition curves were determined for L-glutamate (Glu), and L- glutamate with 0.2 
1 0 mM IMP (Glu/IMP), each at two different concentrations. The IC50s are 0. 19 ± 0.02 
mM and 0.21 ± 0.01 mM for L-glutamate at 8 and 80 mM; 0.35 ± 0.03 mM and 0.82 ± 
0.06 mM for L-glutamate with IMP at 0.8 and 8 mM respectively. Figure 25c shows the 
dose responses for L-glutamate, with or without 0.2 mM IMP, were determined in the 
presence of different concentrations of lactisole. In the presence of 0, 25, or 50-fiM 
1 5 lactisole, the EC50s are 9.9 ± 1 .5 mM, 7.9 ± 0.5 mM, and 7.0 ± 0.3 mM for L- 

glutamate; in the presence of 0, 100, or 200 \xM lactisole, the ECsos are 0.53 ± 0.04 
mM, 0.71 ± 0.10 mM, and 0.84 ±0.10 mM for L-glutamate with IMP. Values represent 
the mean ± SE for four independent responses. Figure 25d shows the detection 
thresholds for sweet, umami, and salty taste stimuli were determined in the presence or 
20 absence of lactisole. The inhibition effect of lactisole is shown as fold increases in 
detection thresholds. "Detection thresholds" are defined as the lower limit of detectable 
tastants. The detection threshold values were averaged over four trials for three subjects. 
Figure 25e shows the responses of human T1R1/T1R3 stable cell line to threshold level 
of L-glutamate (4 mM) and endogenous M2 receptor agonist carbachol were assayed on 
25 FLIPR in the absence and presence of various concentrations of cyclamate. Figure 25f 
shows the dose-responses of the human T1R1/T1R3 stable cell line were determined on 
FLIPR for L-glutamate with or without 0.2 mM IMP in the absence and presence of 
cyclamate (8 mM). The activities in B, C, E and F represent the mean ± SE of fold 
increases in fluorescence intensities for four recorded wells. The dose- responses in B, 
30 C, E and F were reproduced at least 6 times independently. 

Figure 26 shows a working model for the sweet and umami taste receptor 
structure-function relationships. Filled arrows indicate direct activation, open arrows 
indicate enhancement, and bar heads indicate inhibition. 
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Figure 27a shows all 16 combinations of TIRs and chimeras that were tested 
for responses to sweeteners and lactisole. rTlR2/TlR3H-R, iTlR2/hTlR3, and T1R2H- 
R/T1R3R-H show a significant response to cyclamate and they can be inhibited by 
lactisole. T1R chimeras were transiently transfected into HEK-293T cells with G a i5/n. 
5 The activities represent the mean ± SE of number of responding cells for four imaged 
field of -1,000 confluent cells, each unit on the Y axis represents 50 responding cells. 
Abbreviations: Sue (sucrose lOOmM); Suc/Lac (sucrose lOOmM, lactisole 1 mM); 
AceK (acesulfame K 10 mM); AceK/Lac (acesulfame K 10 mM, lactisole 1 mM); 
ATM (aspartame lOmM); NTM (neotame 10 mM); Cyc (cyclamate 10 mM). Figure 

10 27b shows the lactisole dose-dependent inhibition curves of the human sweet receptor 
were determined for sucrose (Sue), saccharin (Sac), and D-tryptophan (D-Trp), each at 
two different concentrations. The IC50s are 19.6 ± 0.1 nM and 64.6 ± 0.3 \iM for 
sucrose at 50 mM and 120 mM; 22.6 ± 0.1 \M and 103 ± 7 \jM for saccharin at 0.1 and 
2 mM; 19.9 ± 0.2 \iM and 168 ± 9 (xM for D-tryptophan respectively. Figure 27c shows 

15 the dose responses of human sweet receptor for sucrose, D-Trp and saccharin were 

determined with different concentrations of lactisole. In the presence of 0, 10, or 20 jiM 
lactisole, the EC50S are 19.4 ± 0.9 mM, 24.7 ± 1.0 mM, and 31.3 ± 0.3 mM for sucrose; 
0.37 ± 0.02 mM, 0.60 ± 0.03 mM, 0.94 ± 0.08 mM for D-Trp; 42 ± 3 ^M, 67 ± 6 (iM, 
1 1 8 ± 2 \xM for saccharin. Values represent the mean ± SE for four independent 

20 responses. The dose-responses in B and C were determined at least 6 times 
independently, and generated similar results as shown here. 

Detailed Description of the Invention 

The inventions provides compounds that specifically bind to the wild-type and 
25 chimeric sweet and umami taste receptors disclosed herein. Further provided are 

compounds that specifically bind to the wild-type, chimeric or truncated T1R2 or T1R3 
subunits of the sweet and umami receptors. 

Binding to the T1R2/T1R3 sweet receptor defines a large genus of molecules. 
The receptor responds to every sweetener tested, including carbohydrate sugars, amino 
30 acids and derivatives, sweet proteins, and synthetic sweeteners. In the meantime, the 
receptor exhibits stereo-selectivity for certain sweeteners, for example, it responds to D- 
tryptophan but not L-tryptophan, which is in correlation with taste physiology data. 
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Thus, the compounds of the invention specifically bind chimeric receptors. 
Examples include, but are not limited to, a chimeric T1R2/T1R3 receptor comprising a 
human T1R2 subunit and a rat T1R3 subunit; a chimeric T1R2/T1R3 receptor 
comprising a rat T1R2 subunit and a human T1R3 subunit; a chimeric T1R2 receptor 
5 subunit comprising a human extracellular domain, a rat transmembrane domain and a 
rat intracellular domain; and a chimeric T1R3 receptor subunit comprising a rat 
extracellular domain, a human transmembrane domain and a human intracellular 
domain. The invention provides functional taste receptors, preferably human taste 
receptors, that are produced by co-expression of a combination of different TIRs, 
1 0 preferably T1R1/T1R3 or T1R2/T1R3, and the corresponding isolated nucleic acid 
sequences or fragments, chimeras, or variants thereof that upon co-expression result in 
a functional taste receptor, i.e., a sweet taste receptor (T1R2/T1R3) or umami taste 
receptor (T1R1/T1R3). 

TIRs, a family of class C G protein-coupled receptors (GPCRs), are selectively 
15 expressed in the taste tissue (Hoon, M.A., et al., Cell, 1999. 96(4): p. 541-51, 

Bachmanov, A.A., et al., Chem Senses, 2001. 26(7): p. 925-33, Montmayeur, J.P., et al., 
Nat Neurosci, 2001. 4(5): p. 492-8, Max, M., et al., Nat Genet, 2001. 28(1): p. 58-63, 
Kitagawa, M., et al., Biochem Biophys Res Commun, 2001. 283(1): p. 236-42 and 
Nelson, G., et al., Cell, 2001. 106(3): p. 381-90.) Functional expression of TIRs in 
20 HEK293 cells revealed that different combinations of TIRs respond to sweet and 

umami taste stimuli (Nelson, G., et al., Cell, 2001. 106(3): p. 381-90, Li, X., et al., Proc 
Natl Acad Sci USA, 2002. 99(7): p. 4692-6.) T1R2 and T1R3, when co-expressed in 
293 cells, recognize diverse natural and synthetic sweeteners [For the reason mentioned 
above re "diverse", please consider whether we need this section for enablement. If not, 
25 I'd delete. We can discuss], while T1R1 and T1R3 recognize umami taste stimulus L- 
glutamate, and this response is enhanced by S'-ribonucleotides, a hallmark of umami 
taste. Knockout data confirmed that TIRs indeed mediate mouse sweet and umami 
tastes (Damak, S., et al., Science, 2003 301(5634): p. 850-3, Zhao, G.Q., et al, Cell 
2003Oct31;115(3):255-66). 
30 The class C GPCRs possess a large N-terminal extracellular domain, often 

referred to as the Venus flytrap domain (VFD) (Pin; IP., Pharmacol Ther, 2003 98(3): 
p. 325-54), and are known to function as either homodimers, in the cases of 
metabotropic glutamate receptors (mGluRs) and calcium- sensing receptor (CaR), or 
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heterodimers, in the case of y-aminobutyric acid type B receptor (GABAbR). The 
functional expression data shows a heterodimer mechanism for TIRs: both T1R1 and 
T1R2 need to be coexpressed with T1R3 to be functional, which is supported by the 
overlapping expression patterns of TIRs in rodent tongue. 
5 It is established herein that T1R family members act in combination with other 

T1R family members to function as sweet and umami taste receptors. As disclosed in 
further detail infra in the experimental examples, it has been demonstrated that 
heterologous cells which co-express hTlR2 and hTlR3 are selectively activated by 
sweet taste stimuli in a manner that mirrors human sweet taste. 
10 For example, HEK-293-Gal5 cells that co-express hTlR2 and hTlR3 

specifically respond to cyclamate, sucrose, aspartame, and saccharin, and the dose 
responses for these compounds correlate with the psychophysical taste detection 
thresholds. 

Also, as supported by data in the experimental examples, it has been shown that 

15 cells which co-express hTlRl and hTlR3 are selectively activated by glutamate 
(monosodium glutamate) and 5 '-ribonucleotides in a manner that mirrors human 
umami taste. For example, HEK-293-Gal5 cells that co-express hTlRl and hTlR3 
specifically respond to glutamate and the dose response for this umami-tasting 
compound correlates with its psychophysical taste detection threshold. Moreover, 5'- 

20 ribonucleotides such as IMP enhance the glutamate response of the T1R1/T1R3 
receptor, a synergism characteristic of umami taste. 

Further, as shown by experimental data in the examples it has been shown that 
cells which stably and inducibly co-express T1R1/T1R3 selectively respond to the 
umami taste stimuli L-glutamate and L-aspartate and only weakly respond to other L- 

25 amino acids, and at much higher concentrations, providing further evidence that the 
T1R1/T1R3 receptor can be used in assays to identify compounds that modulate 
(enhance or block) umami taste stimuli. 

Examples of compounds that specifically bind to the sweet receptor and 
modulate sweet taste can be found in Table 5. 

30 Tables 1-4 provide examples of compounds that specifically bind to the umami 

receptor and modulate umami taste. 



11 



WO 2005/015158 



PCT/US2004/025459 









Table 1 


- Umami Amides 


Compound 
No. 


Compound 


Umami 
EC s „(uM) 


£cso ratio (vs. 
MSG) 


@ 
(uM) 


Al 


3,6-Dichloro-N-(4-emoxy-pheiiyl)-2- 
methoxy-benzamide 


0.22 


2.74 


1 


A2 


4-(3,6-Dichloro-2-metho 
benzoylamino)-benzoic acid 
ester 


xy- 

methyl 


0.93 


6.98 


0.01 


A3 c 


ethoxyphenyl)benzamide 


1.08 


6.14 


0.03 


A4 


2-[(Benzo[b]thiophene-2-car 
amino]-4-methyl-pentanoic 
methyl ester 


...iiiiih 

y 

bonyl)- 
;acid 


0.4 






A5 


2-[(Benzofuran-2-carbonyl)-ai 
methyl-pentanoic acid meth) 


J 

••IIIIIH 

r 

nino]-4- 
A ester 


0.31 






A6 


2-[(5-Methoxy-benzofurai 
carbonyl)-amino]-4-methyl-pc 
acid methyl ester 


X 

1 

1-2- 
ntanoic 


032 


2.86 


1 
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Table 1 - Umami Amides 



Compound 
No. 



Compound 



Umami 
ECso (uM) 



Ecso ratio (vs. 
MSG) 



@ 
(uM) 



A7 




0.46 



(R)-5-methoxy-N-( 1 -methoxy-4- 
methylpentan-2-yl)benzofuran-2- 
carboxamide 



A8 




0.5 



5 -methyl-N-(5 -methy lhexan-3 -yl) 
benzofliran-2-carboxamide 



A9 




0.71 



2-[(Benzofuran-5-carbonyl)-amino]-4- 
methyl-pentanoic acid methyl esters- 
methyl 2-(benzofuran-5-carboxarnido)- 
4-methylpentanoate 



A10 




0.91 



N-(heptan-4-yl)-5-methoxybenzofiiran- 
2-carboxamide 



4.51 



All 




1.05 



5-chloro-N-(l-methoxybutan-2- 
yl)benzofUran-2-carboxamide 



6.5 



0.3 



A12 




1.13 



5 -methoxy-N-(2-methylhexan-3 - 
yl)benzofuran-2-carboxarnide 
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Table 1 - Uraami Amides 



Compound 
No. 



Compound 



Umami 
ECso (uM) 



Ecso ratio (vs. 
MSG) 



@ 
(uM) 



A13 




1.14 



5-methoxy-N-(pentan-3-yl)benzofuran- 
2-carboxamide 



4.46 



A14 




1.14 



2-[(5-Methoxy-benzoforan-2- 
carbonyl)-amino]-4-methylsulfanyl- 
butyric acid methyl ester methyl 2-(5- 
methoxybenzofiiran-2-carboxamido)-4- 
(methylthio)butanoate 



A15 




1.14 



(lR,2R)-ethyl2-(5- 
methoxybenzofuran-2- 
carboxamido)cyclohexanecarboxylate 



A16 




1.18 



5-methoxy-N-(2-methylpentan-3- 
yl)benzofuran-2-carboxamide 



A17 




1.2 



✓ - o 
N^2,4-dimethylpentan-3-yl)-5- 
methoxybenzofuran-2-carboxamide 



A18 




1.27 



5 -methoxy-N-(2 - methylheptan-4- 
yl)benzofuran-2-carboxamide 
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Table 1 - 


Umaml Amides 


Compound 
No. 


Compound 


Umami 
EC 50 (uM) 


Ec so ratio (vs. 
MSG) 


@ 
(uM) 


A19 


5-methoxy-N-( 1 -methoxypentan-2- 
yl)benzofuran-2-carboxamide 


1.3 






A20 


0 






1.32 








5-methyl-N-(2-methylheptan-4-yl) 
benzofuran-2-carboxamide 








A21 


N-(pentan-3-yl)benzofuran-2- 
carboxamide 


1.52 


3.74 


1 


A22 


ca 

( 






1.58 








Benzothiazole-6-carboxylic acid (1- 
propyl-butyl)-amide 








A23 




w 




0.38 








2-methyl-N-{2-inethylheptan-4- 
yl)benzo[d]oxazole-5-carboxamide 








A24 


2-methyl-N-(2-methylheptan-4- 
yl)benzo[dloxazole-6-carboxamide 


1.12 
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Table 1 - Umami Amides 


Compound 
No. 


Compound 


Umami 
EC50 (uM) 


Ecso ratio (vs. 
MSG) 


@ 
(uM) 


A25 


(R)-4-Methyl-2-[(2-methyl- 
benzooxazole-6-carbonyl)-amino]- 
pentanoic acid methyl ester 


1.48 






A26 


2-methyl-N-(2-methylhexan-3- 
yl)berjzo[d]oxazole-6-carboxamide 


1.6 




• 


All 


2-ethyl-N-(heptan-4-yl)benzo[d] 
oxazole-6-carboxamide 


1.61 






A28 


(R)- 4-Methyl-2-[(2-memyl- 
benzooxazole-5-carbonyl)-amino]- 
pentanoic acid methyl ester 


1.69 






A29 


N-(heptan-4-yl)benzo [d] 
oxazole-6-carboxamide 


1.91 






A30 


Bf \^0 HN 1 

5-bn>mo-N-(heptan-4-yl)furan-2- 
carboxamide 


0.49 


12.6 


1 
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Table 1 - Umami Amides 



Compound 
No. 



Compound 



Umami 
ECso (uM) 



Ecso ratio (vs. 
MSG) 



@ 
fuM) 



A31 




0.62 



N-(heptan-4-yl)-4,5-dimethylfuran-2- 
carboxamide 



10.04 



A32 



A33 



A34 




1.15 



N-(2,3-dimethylcyclohexyl)-3- 
methylfuran-2-carboxamide 




1.33 



4,5-dimethyl-N«(2-methylcyclohexyl) 
naran-2-carboxamide 




0.53 



(R)-methyl 2-(lH-indole-2- 
carboxamido)-4-methylpentanoate 



A35 




0.82 



N-(heptan-4-yl)-lH-indole-6- 
carboxamide 



8.81 
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Table 1 - Umami Amides 



Compound 
No. 



Compound 



Umami 
ECso (uM) 



Ec M ratio (vs< 
MSG) 



@ 
(uM) 



A36 




1.01 



(R)-methyl 2-OH-indole-5- 
carboxamido)-4-metliylpentanoate 



A37 




1.5 



(R)-methyl 4-methyl-2-(quinoline-6- 
carboxamido)pentanoate 



A38 




1.22 



5-Methyl-thiophene-2-carboxylic acid 
(l-propyl-bu^l)-amide 



6.54 



A39 




1.31 



5-Methyl-thiophene-2-carboxylic acid 
(1,2,3,4-tetrahydro-naphthalen-l-yl)- 



2.3 



A40 




0.37 



(R)-methyl 2-(2-naphthamido)-4- 
methylpentanoate 
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Table 1 - Umami Amides 


Compound 
No. 


Compound 


Umami 
EC 5D (uM) 


Ecso ratio (vs. 
MSG) 


1 @ 
(uM) 


















A41 


0 


a 


r 

n 




0.7 


2.14 


3 




N-(nonan-5-yl)benzo[d][l,3]dioxole-5- 
carboxamide 








A42 


c 


c 


L 


T H v/"""" H 


0.35 








(2R,3R)-me%12- 
(benzo[d][l,3]dioxole-5-carboxamido)- 
3-methylpentanoate 








A43 


° i 




c 




0.49 








2-[(Benzo[l,3]dioxole-5-carbonyl)- 
amino]-hexanoic acid methyl ester 








































A44 










0.61 








(R)-2-[(Benzo[l,3]dioxole-5- 
carbonyl)-amino]-hexanoic acid methyl 
ester 


























0 


0 












A45 






0 




0.88 








(R)-ethyl 2-(benzo[d][l,3]dioxole-5- 
carboxamido>4-methylpentanoate 
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Table 1 - Umami Amides 



Compound 
No. 



Compound 



Umami 
ECso (uM) 



Ec 50 ratio (vs, 
MSG) 



@ 
(uM) 



A46 




1.32 



(R)-methyl 2-(2,3-dihydrobenzofuran- 
5-carboxamido)-4-metiiylpentaiioate 



A47 




1.33 



(S)-N-(l,2,3,4-tetrahydronaphthalen-l- 
y^benzofdlCl^ldioxole-S-carboxamide 



6.42 



0.1 



A48 




1.51 



N-(4-phenylbutan-2-yl)benzo[d] 
[1 ,3]dioxole-5-carboxamide 



9.27 



A49 




1.54 



2-[(Benzo[l,3]dioxole-5-carbonyl)- 
amino]-pentanoic acid methyl ester 



9.53 



A50 




1.57 



•v T . 

N-(benzo[d][l,3]dioxol-5-yl)-2- 
. propylpentanamide 
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Table 1 - Uraarai Amides 


Compound 
No, 


Compound 


Umami 
EC 5D (uM) 


Ec 50 ratio (vs. 
MSG) 


® 
(uM) 
















A51 








1.58 








(R)-propyl 2-(benzo[d][l,3]dioxole-5- 
carboxamido)-4-methylpentanoate 






















A52 




c 




1.65 








N-(heptan-4-yl)-2,3- 
dihydrobenzofuran-5-carboxamide 






















A53 


CO 


c 




1.83 








N-(hexan-3-yl)benzo[d][l,3] 
dioxole-5-carboxamide 














^ ^ 








A54 








0.12 








N-(hexan-3-yl)-3-methyM- 
(methylthio)benzamide 






















A55 




— i 




0.12 








ct 

methyl 2-(3-chloro-4- 
methoxybenzamido)hexanoate 
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Table 1 - Umami Amides 


Compound 
No. 


Compound 


Umami 
ECso (uM) 


Ec 50 ratio (vs. 
MSG) 


@ 
(uM) 


A56 


N-(hexan-3-yl)-3,4-imethylbenzamide 


0.14 






A57 


(R)-methyl 4-methyl-2-(4- 
vinylbenzamido)pentanoate 


0.18 






A58 


4-methoxy-3-methyl-N-(2- 


0.2 






A59 


4-methoxy-3-methyl-N-(2- 
methylhexan-3-yl)benzamide 


0.2 






A60 


(R)-methyl2-(4- 
(ethylthio)benzamido)-4 
methylpentanoate 


H 


0.2 
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Table 1 - Umami Amides 



Compound 
No. 



Compound 



Umami 
ECso (uM) 



Ecso ratio (vs. 
MSG) 



@ 
(uM) 



A61 




0.22 



N-(heptan-4-yl)-4-methoxy-3- 
methylbenzaraide 



A62 




0.25 



(R)-methyl2-(3,4- 
dimethylbenzamido)^3- 
methylbutanoate 



A63 




0.25 



(R)-methyI 2-(4-methoxy-3- 
methylbenzamido)-4"methylpentanoate 



A64 




0.26 



4-ethoxy-3-methyl-N-(pentan-3- 
yl)benzamide 



A65 




0.29 



(R)-N-( 1 -methoxy~4-methylpentan-2- 
yl)-3-methyl-4-(methylthio)benzamide 
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Table 1 - Umarai Amides 


Compound 
No. 


Compound 


Umami 
ECso (uM) 


Ecso ratio (vs. 
MSG) 


@ 1 
(uM) 






7 








A66 






0.29 








N-(2,4-dimethoxybenzyl)-3-(lH- 
pyrrol- 1 -yl)isonicotinamide 
































A67 






0.29 


10.75 


1 




a 

methyl 2-(3-chloro-4- 
methoxybenzamido)pentanoate 








A68 






0.32 


2.62 


0.3 




—i w x >> . \ 

4-ethoxy-N-(heptan-4-yl)benzamide 








A69 


-cvc 




0.32 








\=/ o \ 
(R)-methyl 4-methyl-2-(4- 
methylbenzamido)pentanoate 








A70 






0.33 








N-(heptan-4-yl)-3- 
(trifluoromethyl)benzaniide 
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Table 1 - Umami Amides 



Compound 
No. 



Compound 



Umami 
ECso (uM) 



Ec 50 ratio (vs 
MSG) 



@ 
(uM) 



A71 



4-ethyl-N-(heptan-4-yl)benzamide 



0.34 



A72 




4-ethoxy-3 -methyl-N-(5-methylhexan- 
3-yl)benzamide 

~* ^T" 



0.34 



A73 




0.34 



(R)-methyl 2-(3-methoxy-4- 
methylbenzamido)-4-methylpentanoate 



A74 




0.35 



3-fluoro-N-(heptan-4-yl)-4- 
methoxybenzamide 



4.98 



0.3 



A75 




0.39 



N-(heptan-4-yl)-4- 
(methyltfaio)benzamide 



A76 




0.4 



4-methoxy -3 -methyl-N-(4- 
phenylbutan-2-yl)benzamide 
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Table 1 - Umami Amides 



Compound 
No. 



Compound 



Umami 
ECso (uM) 



Ec 50 ratio (vs. 
MSG) 



@ 
(uM) 



All 




0.44 



CI 



3-chloro-4-methoxy-N-(2- 
methylcyclohexyl)benzamide 



A78 




0.46 



N-(heptan-4-yl)-4-vinylbeii2amide 



10.22 



0.3 



A79 




0.46 



N-(heptan-4-ylH-methoxybenzamide 



A80 





0.47 



3-chloro-4-methoxy-N-(pentan-2- 
yl)ben2amide 



5.12 



0.1 



A81 




0.5 



N^exan-3-yl)-4-methyl-3- 
(methylthio)benzaniide 
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Table 1 - Umami Amides 



Compound 
No. 



Compound 



Umami 
ECso(uM) 



Ec 50 ratio (vs, 
MSG) 



@ 
(uM) 



A82 




0.51 



(R)-methyl 4-methyl-2-(4- 
propoxybenzamido)pentanoate 



A83 




0.52 



N-(heptan-4-yI)-3-metfayIbenzamide 



A84 




0.53 



N-(heptan-4-yl)-2-hydroxy-3- 
methoxybenzamide 



A85 




0.53 



(R)-methyl2-(3,5- 
dimethylbenzamido)-4- 
methylpenta noate 



A86 




0.53 



methyl 2-(4-methoxy-3~ 
methylbenzamido)-4- 
(methylthio)butanoate 
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Table 1 - Umami Amides 


Compound 
No. 


Compound 


Umami 
EC 50 (uM) 


Ec 50 ratio (vs. 
MSG) 


@ 
(uM) 






^ / — \ 

HN f \ 




















A87 






0.54 


3.8 


i 




2-hydroxy-3-methoxy-N-(l,2,3,4- 
tetrahydronaphthalen- 1 -yl)benzamide 




















A88 




0.55 








N-(2,4-dimethylpentan-3-yl)-3-methyl- 
4-(methylthio)benzamide 










i 










A89 






0.6 


2.85 


i 




(R)-3-chloro-4-methoxy-N-(l-(4- 
methoxyphenyl)ethyl)benzamide 




















a on 






ft 61 








N-(heptan-4-yl)-3-methoxybenzamide 
































A91 






0.62 








(R)-methyl 4-methyl-2-(4- 
propylbenzamido)pentanoate 
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Table 1 - Umami Amides 


Compound 
No. 


Compound 


Umami 
EC 50 (uM) 


Ecso ratio (vs. 
! MSG) 


@ 
(uM) 


A92 


1 

4-ethoxy-3-methyl-N-(2~methylheptan- 
4-yl)benzamide 


0.65 






A93 


ri ~\) 

0 OH } = \ 

VJ 

(S^-hydroxy-S-methoxy-N^l^^^- 
tetrahydronaphthalen- 1 -yl)benzamide 


0.7 


5.7 


1 


A94 


(R)-4-methoxy-N-(2-methoxy- 1 - 
phenylethyl)-3-methylbenzamide 


0.72 






A95 


(R)-methyl 2-<4-metlioxy-3,5- 
dimethylbenzamido)-4- 
methylpentanoate 


0.74 
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Table 1 - Umami Amides 



Compound 
No. 


Compound 


Umami 
ECso (uM) 


Ecjo ratio (vs. 
MSG) 


@ 
(uM) 


A96 




4-methoxy-N-(l-(4- 
methoxyphenyl)propyl)-3- 
methylbenzamide 


0.76 






A97 


4-methoxy-N-( 1 -methoxypentan-2-y 1)- 
3-methylbenzamide 


0.85 






A98 


CI 

3-chloro-N-( 1 -hydroxy-4- 
methylpentan-2-yl)-4- 
methoxybenzamide 


0.88 






A99 


j 

i 


(R)-methyl4-methyl-2-(3- 
nethylbenzamido)pentanoate 


0.89 
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Table 1 - Umami Amides 


Compound 
No. 


Compound 


Umami 
EC 50 (uM) 


Ec 5 o ratio (vs. 
MSG) 


@ 
(uM) 


A100 


i 

3-chloro-4-methoxy-N-(l-p- 
tolylethyl)benzamide 


1.1 






A101 


N-(heptan-4-yl)-2-hydroxy-4- 
methoxybenzamide 


1.16 


7.62 


1 


A102 


4-hydroxy-3-methyl-N-{ 1 ,2,3,4- 
tetrahydronaphthalen- 1 -yl)benzanride 


1.32 


9.49 


1 | 


A103 


(lS,2R)-ethyl 2-(3-chloro-4- 
methoxybenzamido) 
cyclohexanecarboxylate 


1.36 
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Table 1 - Umami Amides 


Compound 
No. 


Compound 


Umami 
ECso (uM) 


Ec 50 ratio (vs. 
MSG) 


@ 
(uM) 












A104 


Biphenyl-2-caiboxylic acid 2,4- 
dimethoxy-benzylamide 


1.37 






A105 


(S)-N-(l ,2,3,4-tetrahydronaphthalen-l- 
yl)-4-vinyIbenzamide 


1.38 


2.79 


1 


A106 


c/ \J 

3-chloro-N-(2,3»dihydro-lH-inden-l- 
yl)-4-methoxybenzanride 


1.39 


4.01 


0.3 
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Table 2 - Umami Oxalamides 


Compound 
No. 


Compound 


Umami EC 50 
(uM) 


EC50 ratio (vs. 
MSG) 


Bl 






0.18 






1 

Nl-(2,4-dimethoxybenzyl)-N2-(2-(furan-2- 
yl)ethyl)oxalamide 






B2 


0 

Nl-(4-ethoxy-2-mefhoxybenzyl)-N2-(2-(5- 
methylpyridin-2-yl)ethyl)oxalanride 


0.19 








o 










0 ' j r 






B3 


o 




0.81 






N^3-Me%l-benzo|>]thiophen-2-ylniethyl)-N , -(2- 
pyridin-2-yl-ethyl)-oxalamide 






B4 


, O _/=\ 

V- O \— NH HN — ' H—y 

oH 

0 0 

Nl-(2-isopropoxybenzyl)-N2-(2-(pyridin-2- 
yl)ethyl)oxalamide 


1.22 
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Table 3 - Umami Ureas 


Compound 
NO. 


IUPAC Name 


Umami 
uM 


Ec50 
ratio (vs. 
MSG) 


Con. 
(uM) 


CI 


CI N — \ 
l-(2-chlorophenyl)-3-(heptan-4-yl)iirea 


0.37 


4.95 


1 


C2 


CI HN— ^ > 

0 

l-(2,4-dichloropheny^-3-(l-phenylpropyl)urea 


0.49 


4.52 


1 


C3 


l-(2-chlorophenyl)-3-(2-methylcycloliexyl)urea 


0.52 


3.24 


3 


C4 


0 x 
l-(2-fluorophenyl)-3-(heptan-4-yl)urea 


0.79 


12.15 


3 


C5 


HN Y N y^ 

H2-chlorophenyl)-3-(l-cyclohexyIethyl)urea 


0.84 


9.08 


1 


C6 


HN^Q 
0 

l-(4-isopropylphenyl)-3-(2-(pyridin-2-vnethvnurea 


0.98 






C7 


l-(2-chlorophenyl)-3-(l,2,3,4-tetrahydronaphthalen- 
l-yl)urea 


0.99 


3.68 


1 
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Table 3 - 


Umami Ureas 


Compound 
No. 


IDPAC Name 


Umami 
uM 


Ec50 
ratio (vs. 
MSG) 


Con. 
(uM) 


C8 


1 -(2,4-dimethoxyphenyl)-3-(2- 
methylcyclohexyl)urea 


1.41 


2.62 


0.3 


C9 


HN / 

\ 

1 -{2-ethylphenyl)-3-(heptan-4-yl)urea 


1.42 






CIO 




1.51 


2.1 


0.3 


Cll. 


1 -(2-fluorophenyl)-3-0 ,23.4-tetrahvdronaDhthalen- 
l-yl)urea 


1.65 


4.49 


1 


C12 


—0 HN — ^ N ' 

0 

H2-methoxyphenyl)-3-(2-methylcyclohexyl)urea 


1.67 






C13 


0^ 

l-(2,4-dimethoxyphenyl)-3-(pentan-3-yl)urea 


1.72 


11.87 


1 
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Table 4 - Urn a mi Acrvlamides 


Compound 
No. 


Compound 


Umami EC 5 o 
(uM) 


Ecso ratio 
(vs. MSG) 


@ 
(uM) 


JL/i 


1 X. 

(E)-N-(2,4-dimethylpentan-3-yl)-3-(4- 
methoxyphenyl)acryIamide 


0.29 


3.46 


1 


D2 


(R,E)-methyl 2-(3-(4-methoxyphenyl) 
acrylamido)-4-methylpentanoate 


0.32 






D3 


(E)-methyl 2-(3-(4-methoxyphenyl) 
acrylamido)hexanoate 


0.63 






D4 


N-( 1 -Methyl-3-phenyl-propyl)-3- 
thiophen-2-yl-acrylamide 


0.69 


9.73 


1 


DS 


(E)-N-(heptan-4-yl)-3-(4 
methoxyphenyl)acrylami< 


ie 


0.72 


3.48 


0.3 


D6 


. 0 s 

N-(l-Propyl-butyl)^-thiophen-2-yl- 
acrylamide 


0.75 


6.3 


1 


D7 


hkh: 

0 

(E)-3-(4-methoxyphenyl)-N- 
(pentan-3-yl)acrylamide 


0.82 


9.62 


1 
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Table 4 - Umami Acrvlamides 


Compound 
No. 


Compound 


Umami EC 50 
(uM) 


Ecso ratio 
(vs. MSG) 


® 
(uM) 












D8 


(R,E)-3-(4-ethoxyphenyl)-N<l- 
methoxy-4-methylpentan-2- 
yl)acrylamide 


0.94 






D9 


(Z)-N-(heptan-4-yl)hex-2-enamide 


0.98 


























D10 


(R,E)-methyl 4-methyl-2-{3.(thiophen- 
3-yl)acrylamido)pentanoate 


1.09 
















Dll 


(R)-methyl 2-cinnamamido-4- 
methjrlpentanoate 


1.17 








/ N v H / 








D12 


-C o 

( E)-4-methvl-N-^2-meth vie vcl ohex vH 
pent-2-enamide 


1.28 
















D13 


hls^-. 

(E)-N-sec-butyl-3-(4- 
ethoxyphenyl)acrylamide 


1.31 


2.7 


0.3 
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Table 4 - Umami Acrylamides 


Compound 


Compound 


Umami EC50 


Ecso ratio 
(vs. MM*) 


@ 
(uM) 


D14 


(E)-N-(l-methoxybutan-2-yl)-3-(4- 
methoxyphenyl)acrylamide 


1.43 


8.48 


1 


D15 


(E)-N-(heptan-4-yl)-3- 
(thiophen-3 -yl)acrylamide 


1.54 


2.22 


0.3 


D16 


(E>3-(3,4-dimethoxyphenyl)-N-(4- 
phenylbutan-2-yl)acrylamide 


1.56 


3.13 


1 
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Table 5 • Sweet EnhancerAmides 


Compound 
No. 


Compound 


Sweet 
EC 50 
uM 


Umami 
EC50 
uM 


Umami 
EC50 
ratio 












El 


CI 

3-chloro-2-hydroxy-N-(2-methyl-l,2,3,4- 
tetrahydronaphthalen- 1 -yl)benzamide 


0.19 








o 










oi-6 

i v/n 

CI 

(R)-3-chloro-2-hydroxy-N-(l,2,3,4- 
tetrahy(3ronaphthalen-l-yl)benzamide 








E2 


0.65 








/A ) 








E3 


CI OH /=\ 

3-chloro-2-hydroxy-N-(5-hydroxy-l,2,3,4- 
tetrahydronaphthalen- 1 -yl)benzamide 


1.03 


























E4 


CI OH ) ( 

\ ) 

3-chloro-2-hydroxy-N-(4-methyl-l,2,3,4- 
tetrahydronaphthalen- 1 -yl)benzamide 


1.61 








in 








E5 


Cl " UL 

l 1 

3-chloro-2-hydroxy-N-(6-methoxy-l,2,3,4- 

tetrahydronaphthalen-l-yl)benzamide 


1.61 
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Table 5 - Sweet Enhancer Amides 


Compound 
No. 


Compound 


Sweet 
EC50 
uM 


Umarai 
EC50 
uM 


Umami 
ECso 
ratio 


E6 


3-methyl-NK2-methyl-l,2,3,44etrahydronaphthalen- 
l-yl)isoxazole-4-carboxamide 


1.48 








o 








E7 


y^oH <J 

CI 

3-chloro-2-hydroxy-N-(l,2,3,4- 
tetrahydronaph1halen-l-yl)benzamide 


1.81 




4.04 




o ^""y^^i 








E8 


OH 

2,3-dihydroxy-N-(2-methyl-l,2,3,4- 
tetrahydronaphthalen-l-yl)benzamide 


1.98 
















E9 


o ■ o 

X>H 

2-hydroxy-N-(2-methyl- 1 ,2,3,4-tetrahydronaphthalen- 
l-yl)benzamide 


2.36 






E10 


Oyer" 

OH 

2,3-dihydroxy-N-(5-methoxy-l,2,3,4- 
tetrahydronaphthalen- 1 -yl)benzanride 


2.44 
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Table 5 - Sweet Enhancer Amides 



Compound 
No. 



Compound 



Sweet 
EC50 
uM 



Umami 
ECso 
uM 



Umami 
EC50 
ratio 



Bll 




2.46 



3-methyl-N-(4-methyl-l,2,3,4-tetrahydronaphthalen- 
l-yl)isoxazole-4>carboxamide 



E12 




2.85 



N-(5-methoxy-l,2,3,4-tetrahydronaplithalen-l-yl)-3- 
methylisoxazole-4-carboxamide 



E13 




(S)-3-cWoro-2-me%l-N-(l,2,3,4- 
tetrahydronaphthalen-l-yljbenzamide 



2.91 



E14 




(S)-2,6-dimethyl-N-(l,2,3,4- 
tetrahydronaphthalen-l-yl)benzamide 



2.91 



E15 




2,6-dichloro-N-(l,2,3,4- 
tetrahydronaphthalen- 1 -yl)benzamide 



3.02 



E16 




S^KUcMoro^-methoxy-N-O^^ 
tetrahydronaphtbalen"l-yl)benzamide 



3.04 
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Table 5 - Sweet Enhancer Amides 


Compound 
No. 


Compound 


Sweet 
EC50 
uM 


Umami 
ECso 
uM 


Umami 
ECso 
ratio 




o j^S 










OH 

(R)-23-dihydroxy-N^l,2,3,4-tetrahydroiiaphtiialeii- 
l-yl)benzamide 








E17 


3.13 








0 p ^| 








E18 


un ^ JL JL JL O 

2,5-dihydroxy-N-(5-methoxy- 1 ,2,3,4- 
tetrahydronaphthalen-l-yl)benzamide 


3.38 








o '■f XVs | 








E19 


9- H u 

F 

(S)-3-fluoro-2-methyl-N-(l,2,3,4- 
tetrahydronaphthalen- 1 -y l)benzamide 


3.57 








N> o j^N 










W 

(S)-3-chloro-2,6-dimethoxy-N-(l,2,3,4- 
tetrahydronaphthalen- 1 -yl)benzamide 








E20 


4.13 








CI 








E21 


Br 

(R)-5-bromo-N-( 1 ,2,3,4-tetrahydronaphthalen- 1 - 
yl)nicotinamide 


4.19 
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Table 5 - Sweet EnhancerAmides 


Compound 
No. 


Compound 


Sweet 
EC 50 
uM 


Umami 
ECso 
uM 


Umami 
EC50 
ratio 


E22 


o 

CI 

(R)-3-chloro-N-(l,2,3,4- 
tetrahydronaphthalen-l-yl)benzamide 


4.52 






E23 


F 

(R)-3-fluoro-N-(l f 2,3,4-tetrahydronaphthalen-l- 
yl)benzamide 


4.86 






E24 


(R)-2,5-dihydroxy.N-(l,2,3,4- 
tetrahvdrona'Dhthalen- 1 -vlYhenzflmide 


6.04 






E25 | 


(RJ-S-methyl-N^l^^^-tetrahydronaphtiialen-l- 
yl)isoxazole-4-carboxamide 


7.79 






E26 


(R>5-niethyl-N-(l,2,3,4-tetrahydronaphflialen- 
1 -yl)isoxazole-4-carboxamide 


8.09 






E27 


F 0 1 

2,3,5,6-tetrafluoro-4-methyl-N-(3-methylbutan-2- 
yl)benzamide 


0.14 
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Table 5 - Sweet Enhancer Amides 


Compound 
No. 


Compound 


Sweet 
EC$o 
uM 


Umami 
EC 50 
uM 


Umami 
EC50 
ratio 




P 


















E28 


F TlT |\ 

F 0 1 

N-(3,3-dimethylbutan-2-yl)-2,3,5,6-tetrafluoro-4- 
methylbenzamide 


ft 91 


























E29 


-r 6- 

N^2-methylcyclohexyl)-3- 
(trifluoromethoxy)benzamide 


0.42 






E30 


/l ! i 

3-cMoro-5-£luoro-N^2-memylcyclohexyl)beiizamide 


0.45 
















E31 


F O 1 

(R)-N-(3,3-dimethylbutan-2-yl)-2,3,5,6-tetrafluoro-4. 
methylbenzamide 


0.49 
















E32 


4-fluoro-N-(2-methylcyclohexyl)-3- 
(trifluoromethyl)benzamide 


0.51 
















E33 


CI 

2,5-dichloro-N-(2-methylcyclohexyl)benzamide 


0.63 
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L Table 5 - Sweet Enhancer Amides ^ 


Compound 
No. 


Compound 


Sweet 
ECso 
uM 


Umami 
EC50 
uM 


Umami 
ECso 
ratio 






F 








E34 






0.71 








F O 1 

2,3,5,6-tetrafluoro-N-(hexan-2-yl)-4- 
methylbenzamide 




















E35 




0.71 










.0 0 k^J 










3,5-dichloro-2,6-dimethoxy-N-(2- 
methylcyclohexyl)benzamide 








E36 




VyyS 


0.72 








2,4,6-ti±nethyl-N^2-methylcyclohexyl)benzamide 








E37 


CI 


o Ij 


0.77 








3,6-dicWoro-2-methoxy-N- 
(2-methylcyclohexyl)benzamide 








E38 




F 


0.9 








(S)-N^3,3-dimethylbutan-2-yl)-2,3,5,6-tetrafluoro-4- 
methylbenzamide 








E39 




CI 0 I J 


0.91 








2,6-dichloro-N-(2-methylcycIohexyl)benzamide 
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Table 5 - Sweet EnhancerAmldes 


Compound 
No. 


Compound 


Sweet 
EC 50 
uM 


Umami 
EC50 
uM 


Umami 
EC50 
ratio 


E40 


CI 

n » i 

2-chloro-6-methoxy-N-(2- 
methylcyclohexyl)isonicotinamide 


0.95 




9.77 


E41 


tetrafluoro-4-methylbenzamide 


1.02 






E42 


N-(l -methoxybutan-2-yl)-2 > 4-dimethylbenzamide 


1.06 






E43 


N-(2,3-dimethylcyclohexyl)-2,3,5,6-tetrafluoro 
-4-methylbenzamide 


1.08 






E44 


O 

V- NH y 

N— ^ > ( 

2^Uoro-N-(2,3-diniethylcyclohexyl)isonicotiiianiide 


1.08 






E45 


N-cyclohexyM^jS^-tetrafluoro^methylbenzamide 


1.13 
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Table 5 - Sweet Enhancer Amides 



Compound 
No. 



Compound 



Sweet 
EC50 
uM 



Umami 
ECs 
uM 



Umami 
ECso 
ratio 



E46 




1.25 



F F 

N-cyclooctyl^^^^-tetrafluoro^-methylbenzamide 



E47 




1.25 



(R)-2,3,5,6-tetrafluoro-4-methyl-N-(3-metiiyIbutan-2- 
yl)benzamide 



E48 




1.29 



3,6-dichloro-N-(2,3-dimethylcyclohexyl)-2- 
methoxybenzamide 



E49 




1.39 



N-cycloheptyl-2,4 > 64rimethylbenzamide 



E50 




1.41 



N-(2,3-dimethylcyclohexyl)-2,4,6- 
trimethylbenzamide 




3-chloro-N-(2,3-4ihydro-lH-inden-l-yl).2 
hydroxybenzamide 
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Table 5 - Sweet EnhancerAmides 



Compound 
No. 



Compound 



Sweet 
EC50 
uM 



Uraami 
EC50 
uM 



Umami 
ECso 
ratio 



B52 




1.52 



2-methyl-N-{2-mefliylcyclohexyl)-l-naphtfaamide 



E53 




NH 



CI 

3-chloro-4-flaoro-N-(2-metlr 



1.7 



E54 



E55 



C 




NH 



CI 



1.83 



3,4-dichloro-N-(2-metfaylcyclohexyl)benzamide 



N 




NH 



Br 



1.89 



5-bromo-N-(2,3-dimethylcyclohexyl)nicotiDamide 



E56 




NH 



n — * 

2-chloro-N-(2-methylcyclohexyl)isonicotinamide 




1.92 



E57 




1.95 



CI O 

2-chloro-3-me&yl-N-(2-methylcyclohexyl)benzamide 



T F 



E58 




2.23 



F F 

N-cyclopentyl-2,3,5,6-tetrafluoro-4-methylben2amide 
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Table 5- 


Sweet Enhancer Amides 


Compound 
No. 


Compound 


Sweet 
ECso 

IUV1 


Umami 
ECso 

nlVff 
ILLVl 


Umami 
EC50 
ratio 


E59 


\ f\ H J 

F F I 

N-(2-methylcyclohexyl)-3- 
(trifluoromethyl)benzamide 






2.34 




2.07 


E60 


4-fluoro-N-(4-methylcyclohexyl)-3- 
(trifluoromethyl)benzamide 




2.37 






E61 


\ A « 1 

f f T T 

2-fluoro-N-(2-methylcyclohexyl)-3- 
(trifluoromethyl)benzamide 




2.4 






E62 


5-bromo-N-(2 


O 

ra 

-methylcyclohexyl)nicotinamide 


2.42 






E63 


2,3-dimethyl-N-(2-methylcyclohexyl)benzamide 


2.6 






E64 


2,6-dichloro-N-(2,3-dimethylcyclohexyl)benzamide 


2.77 






E65 


--oh' 

2-fluoro-N-(2-methylcycloliexyl)isonicotinaniide 


2.83 







49 



WO 2005/015158 



PCT/US2004/025459 



Table 5 - Sweet Enhancer Amides 


Compound 
No. 


Compound 


Sweet 
ECso 
uM 


Umami 
ECso 
uM 


Umami 
EC50 
ratio 


E66 


N-cvclohexvl-2 4 6-trimethvlbenT^amide 


2.86 






E67 


OH 

2-hydroxy-4-methyl-N-(4- 
methylcyclohexyl)benzamide 


2.98 






E68 


F. 

N-(heptan^ 


l-yl)-3-(triflubrometbyl)benzamide 


3.03 


033 




E69 


F 0 

2,3,5,6-tetrafluoro-N-isobutyl-4-metbylbenzaimde 


3.19 






E70 


F 0 ' 

2,3,5,6-tetrafluoro-4-methyl-N-(5-methylhexan-2- 
yl)benzamide 


3.2 






E71 


N-(2-methylcyclohexyl)benzo[c][l,2,5]oxadiazole-5- 
carboxamide 


3.33 






E72 


OH 0 k^^J 

2 -hydroxy-3 -methoxy-N-(4- 
methylcyclohexyl)benzamide 




3.35 
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Table 5 - Sweet Enhancer Amides 


Compound 
No. 


Compound 


Sweet 
EC50 
uM 


Umami 
EC50 
uM 


Umami 
EC50 
ratio 


E73 


Thiophene-2-carboxylic acid(l,3,3-trimethyl- 
bicyclo[2.2. l]hept-2-yl)-amide 


3.36 






E74 


f > *r^ f — 

F 

N-(2,3-dimethylcyclohexyl)-2- 
(perfluorophenyl)acetamide 


3.62 






E75 


CI 0 k 

2,3-dichloro-N-(pentan-3-yl)benzamide 


3.78 






E76 


ci o 

2,3^chloro-N-(2,3-dimethylcyclohexyl)benzamide 


3.99 






E77 


F 

N-(2,3 -dimethylcyclohexyl) -2 , 5 -difluorobenzamide 


4.11 






E78 


4,5-Dichloro-isothiazole-3-carboxylic acid (2-methyl- 
cyclohexyl)-amide 


4.24 


8.51 
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Table S - Sweet Enhancer Amides 


Compound 
No. 


Compound 


Sweet 
EC50 
uM 


Umami 
EC50 
uM 


Umami 
EC50 
ratio 


E79 


OH 0 

N-(2,4-dimethylpentaii-3-yl)-2 ) 6- 
dihydroxybenzamide 


4.28 






E80 


3-chlorO"2-methyl-N-(2-methylcyclohexyl)benzamide 


4.29 






E81 


o 

3,4-difluoro-N-f2-niethylcyclohexyl)ben2amide 


. 4.37 




6.98' 


E82 


j, j~\iLiiiCLLiyi~iN "^-njcinyicycioiiCAyi ) DciizainiQc 


4.48 






B83 


N-(4-etlioxyphenethyl)- 1 -methyl- 1 H-pyrazole-5- 
carboxamide 


4.68 






E84 


3 > 6-dichloro-N-(2-fluorophenyl)-2- 
methoxybeuzamide 


0.83 




16.51 
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Table 5 - Sweet Enhancer Amides 



Compound 
No. 



Compound 



Sweet 
EC50 
uM 



Umami 
EC50 
uM 



Umami 
EC50 
ratio 



E85 




N-(2-Chloro-4,6-dimethoxy-phenyl)-3- 
trifluorometfayl-benzamide 



1.42 



E86 




3,5^chloro-N^2,4-^limethylphenyl)-4- 
methoxybenzamide 



1.48 



E87 




3-Chloro-4-fluoro-N-(5 -trifluoromethyl- 
[1 3,4]thiadiazol-2-yl)-benzamide 



1.55 



E88 




3,5Kiichloro^-methoxy-N-o-tolylbenzamide 



1.84 



E89 




2.56 



5-Chloro-N-(2,4-difluoro-phenyl)-24iydroxy- 
benzamide 



E90 




2.71 



2 > 4-Dichloro-N>(2-cyano-3-fluoro-phenyl)-ben2amide 
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Table 5 - Sweet Enhancer Amides 


Compound 
No. 


Compound 


Sweet 
EC50 
uM 


T Jmami 
EC50 
uM 


T Jmami 

ECso 
ratio 
















XX XT 




Z. /4 








2,6-Dichloro-N-(4-cyano-phenyl)-benzamide 










0 

lT^i^ NH 










E92 






Z. /*t 








4-cMoro-N-(2,4-dimethylphenyl)-3-methylbenzamide 






























E93 


Ci 

. 3,5-dicMoro-4-methoxy-N-(4- 
methoxypheigl)benzamide 




3.24 


















E94 


ci l^JI 




3.56 








3-chloro-N-(2,4-dimethoxyphenyl)-4- 
fluorobenzamide 








E95 






3.58 








5-Cyano-2,4-dimethyl-6-methylsulfanyl-N-phenyl- 
nicotinamide 




















E96 






3.73 








N-(4-tert-Butyl-thiazol-2-yl)-isomcotinaniide 
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Table S - Sweet Enhancer Amides 




Compound 
No. 


Compound 


Sweet 
uM 


Umami 
EC50 
uM 


Umami 
ECso 
ratio 


E97 


S^-DicWoro-N^z^-dimethyl-phenylj-z-metnoxy- 
benzamide 


4.25 






E98 


N -(o-etnylpliciiyij-z-inetnoxy-o-m 


4.63 






E99 


N-(4-bronio-2,6-dimethylphenyl)isoindoline-2- 
carboxamide 


0.93 






E100 


C 

1 


^-(2-metbyl-4-nitrophenyl)isoindoline-2- 
carboxamide 


1.3 






E101 


N-(2,4-difluorophenyl)isoindoline-2-carboxamide 


1.37 






E102 


Q~~l H I 1 

YYj v 

N«<2-methyl-3-nitrophenyl)isoindoline-2- 
carboxamide 


2.01 






E103 


N-(2, 


3 1 4-trifluorophenyl)isomdolme-2-carboxarnide 


2.58 
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I Table 5 - Sweet Enhancer Amides 


Compound 
No. 


Compound 


Sweet 
EC50 
uM 


U id ami 
EC50 
uM 


Umaml 
EC50 
ratio 




N-p-tolylisoindoline-2-carboxamide 








E105 


OK 0 "" 

N-(4-chlorophenyl)isoindoline-2-carboxamide 


3.4 






E106 


CI 

N-(2-chlorophenyl)isoindoline-2-caiboxamide 


3.85 






E107 


CI 

N-(2,4-dichlorophenyl)isoindoline-2-caiboxamide 


4.15 






E108 


N-r4-methox\^henvlMsoindolme-2-carboxainide 


4.99 






E109 


^ — ^ 0 

N-(2 > 4-dicUorophenyl)-3,4-dihyckoisoquinoiine- 
2(lH)-carboxamide 


2.34 






E110 


N-(2-cyanophenyl)-3,4-dihydroisoquinoline-2(lH)- 
carboxamide 


2.5 






BUI 


N-p-tolyl-3,4-4ihydroisoquinoline-2(lH)- 
carboxamide 


4.27 
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Table 5 - Sweet Enhancer Amides 




Compound 
No. 


Compound 


Sweet 
EC50 
uM 


Umami 
EC50 
uM 


Umami 

EC50 
ratio 


El 12 


N^^hloro^-methylphenyl)^- 
dihydroisoquinoline-2( lH)-carboxamide 


4.33 






El 13 


0 

0-a n _^ n ^Q- o/ 

N-(2,4-dimethoxyphenyl)^,4-dihydroisoquinoline- 
2flH)-carboxamide 


4.44 
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Also, as supported by experimental data in the examples, it has been shown that 
cell lines which co-express T1R1/T1R3 or T1R2/T1R3 respectively respond to umami 
or sweet taste stimuli and a quantitative dose-responsive manner which further suppprts 
a conclusion that specific binding to the T1R1/T1R3 and T1R2/T1R3 receptor can be 
5 used to define receptor agonists and antagonists, e.g., MSG substitutes, umami 
blockers, novel artificial and natural sweeteners, and sweet blockers. 

Also, as supported by data in experimental examples, it has been shown that the 
sweet taste blocker lactisole inhibits both the T1R2/T1R3 sweet receptor and the 
T1R1/T1R3 umami taste receptor. Compounds are provided herein that enhance, 
10 mimic, modulate or block sweet or umami taste. The fact that lactisole inhibits both the 
T1R1/T1R3 and T1R2/T1R3 receptors suggests that these receptors may share a 
common subunit which is bound by lactisole and potentially other taste modulators. 
Therefore, this shows that some compounds which enhance, mimic, modulate or block 
sweet taste can have a similar effect on umami taste or vice versa. 
1 5 Further, as supported by data in experimental examples, it has been 

demonstrated that cell lines which stably co-express TIRs, i.e. T1R1/T1R3 or 
T1R2/T1R3, when assayed by automated fluorescence imaging very effectively 
respond to various sweet and umami taste stimuli, i.e. at magnitudes substantially 
greater than transiently transfected cells. Thus, these cell lines are especially well 
20 suited for use in high throughput screening assays for identifying compounds that 

modulate, block, mimic or enhance sweet or umami taste. However, the invention also 
encompasses assays that utilize cells that transiently express a T1R or combination 
thereof. 

Moreover, while the application contains data demonstrating that some TIRs 
25 act in combination, particularly T1R1/T1R3 and T1R2/T1R3, and that such receptor 
combinations may be used in assays, preferably high throughput assays, it should be 
noted that the subject invention also encompasses assays that utilize T1R1, T1R2 and 
T1R3 alone or in combination with other proteins, e.g., other GPCRs. 

There are differences in human and rodent sweet taste in terms of the ligand 
30 specificity, G protein coupling efficiency, as well as sensitivity to inhibitors. The 
species differences in T1R ligand specificity can be utilized to demonstrate that the 
sweet taste receptor indeed functions as a heteromeric complex, and that there is more 
than one ligand binding site on the receptor. Furthermore, a functional link between 
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the sweet and umami receptors mediated by T1R3 has been shown (Example 16). 

Both human and rat sweet receptors can efficiently couple to a chimeric Gal 5 
with the C-terminal tail sequence from Goii (G a i5/u). For example, human but not rat 
T1R2/T1R3 selectively responds to a group of sweeteners, including aspartame, 
5 neotame, and cyclamate. This is consistent with taste physiology data. These 
differences in agonist specificity can be utilized to map their binding sites on the 
receptor. A chimeric T1R can be generated between human and rat genes, with a 
junction immediately before the transmembrane domain. Each T1R chimera therefore 
consists of two halves, the N-terminal extracellular domain, and the C-terminal 

10 transmembrane and intracellular domain, from different species. For example, a 

chimeric T1R2, termed T1R2-R, has a sequence from the N-terminus of human T1R2 
fused to rat T1R2 C-terminal sequence. Responses to these chimeras can then be 
tested (Figure 22). 

Novel compounds and novel flavor, tastants, and sweet enhancers were 

1 5 discovered in the chemistry series of amide derivatives. The amide compounds also 
comprise certain sub-classes of amide derivatives or classes of derivatives related to 
amides, such as for example ureas, urethanes, oxalamides, acrylamides, and the like. 
These compounds, when used together with sucrose or alone, increase a response in 
vitro and concomitant increase in sweet perception in human tasting. These 

20 compounds enhance other natural and synthetic sweet tastants. Examples of these 
compounds are listed in Table 5. 

In one embodiment, the invention provides novel compounds, flavorants, 
tastants, flavor enhancers, taste enhancers, flavor modifying compounds, and/or 
compositions containing them. 

25 In a more specific embodiment, the invention provides novel sweet flavorants, 

sweet tastants, sweet taste enhancers, and sweet taste modifiers and compositions 
containing them. 

More particularly, in another embodiment, the invention is directed to 
compounds that modulate, induce, enhance, or inhibit natural or synthetic sweet 
30 tastants, e.g., naturally occurring and synthetic sweeteners. 

In another embodiment, the invention provides compositions, preferably, 
compositions suitable for human or animal consumption, containing at least one 
compound of the invention. These compositions include foods, beverages and 
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medicinals, and food additives which when added to foods, beverages or medicinals 
modulate the flavor or taste thereof, particularly by enhancing the sweet taste thereof. 

Another embodiment of the invention is directed to use of a compound of the 
invention to modulate the sweet taste of a desired food, beverage or medicinal, which 
5 composition may comprise one or more other compounds that elicit a sweet taste.These 
compounds, when they were used together with naturally occurring and synthetic 
sweeteners, not only increased a response in Vitro but also intensified the sweet and 
other flavor or taste perceptions in human tasting. These specific compounds, when 
they were used together with sweet tastants, such as naturally occurring and synthetic 

10 sweeteners, not only increased the T1R2/T1R3 response in vitro but also intensified the 
sweet taste and other flavor or taste perceptions in human tasting. 

Novel compounds and novel flavor, tastant, and umami enhancers and tastants 
such as amides, ureas, amino-amides, amido-amides, and P -lactams are also disclosed 
herein. These compounds, when used together with MSG or alone, increase a response 

1 5 in vitro and the umami perception in human tasting. These compounds also enhance 
other natural and synthetic umami tastants. Examples of these compounds are listed in 
Tables 1-4. 

In one embodiment, the invention provides novel compounds, flavorants, 
tastants, flavor enhancers, taste enhancers, flavor modifying compounds, and/or 
20 compositions containing them. 

In a more specific embodiment, the invention provides novel umami flavorants, 
umami tastants, umami taste enhancers, and umami taste modifiers and compositions 
containing them. 

More particularly, in another embodiment, the invention is directed to 
25 compounds that modulate (induce, enhance or inhibit) natural or synthetic umami 
tastants, e.g., monosodium glutamate (MSG). 

In another embodiment, the invention provides compositions, preferably 
compositions suitable for human or animal consumption, containing at least one 
compound of the invention. These compositions include foods, beverages and 
30 medicinals, and food additives which when added to foods, beverages or medicinals 
modulate the flavor or taste thereof, particularly by enhancing the umami taste thereof. 

Another embodiment of the invention is directed to use of a compound of the 
invention to modulate the umami taste of a desired food, beverage or medicinal, which 
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composition may comprise one or more other compounds that elicit a umami taste, 
e.g., MSG. These compounds, when they were used together with MSG, not only 
increased a response in vitro but also intensified the umami and other flavor or taste 
perceptions in human tasting. These specific compounds, when they were used together 

5 with umami tastants, such as MSG, not only increased the T1R1/T1R3 response in vitro 
but also intensified the umami taste and other flavor or taste perceptions in human 
tasting. Some of the compounds, when they were tasted alone, elicited human 
perception of umami. 

Compounds defined by specific binding to specific receptors using the 

1 0 present T1R assays can be used to modulate the taste of foods and beverages. 
Suitable assays described in further detail infra include by way of example whole- 
cell assays and biochemical assays, including direct-binding assays using one of a 
combination of different T1R receptors, chimeras or fragments thereof; especially 
fragments containing N-terminal ligand-binding domains. Examples of assays 

1 5 appropriate for use in the invention are described in greater detail infra and are 
known in the GPCR field. 

Assays can be designed that quantitate the binding of different compounds or 
mixtures of compounds to T1R taste receptors or T1R taste receptor combinations or 
T1R receptors expressed in combination with other heterologous (non-TIR) proteins, 

20 e.g. other GPCRs, or that quantitate the activation of cells that express T1R taste 
receptors. This can be effected by stably or transiently expressing taste receptors in 
heterologous cells such as HEK-293, CHO and COS cells. Thus, this physico- 
chemical characteristic of the compounds is used to define a genus of compound that 
share this characteristic. 

25 The assays will preferably use cells that also express (preferably stably) a G 

protein such as Gal 5 or Gal 6 or other promiscuous G proteins or G protein variants, or 
an endogenous G protein. In addition, Gp and G y proteins may also be expressed 
therein. 

The effect of a compound on sweet or umami taste using cells or compositions 
30 that express or contain the above-identified receptors or receptor combinations may be 
determined by various means including the use of calcium-sensitive dyes, voltage- 
sensitive dyes, cAMP assays, direct binding assays using fluorescently labeled ligands 
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or radioactive ligands such as 3 H-glutamate, or transcriptional assays (using a suitable 
reporter such as luciferase or beta-lactamase). 

Assays that may be utilized with one or more TIRs according to the invention 
include by way of example, assays that utilize a genetic selection for living cells; assays 

5 that utilize whole cells or membrane fragments or purified T1R proteins; assays that 
utilize second messengers such as cAMP and IP3, assays that detect the translocation of 
airestin to the cell surface, assays that detect the loss of receptor expression on the cell 
surface (internalization) by tested ligands, direct ligand-binding assays, competitive- 
binding assays with inhibitors, assays using in vitro translated protein, assays that 

1 0 detect conformational changes upon the binding of a ligand (e.g., as evidenced by 
proteolysis, fluorescence, or NMR), behavioral assays that utilize transgenic non- 
human animals that express a T1R or T1R combination, such as flies, worms, or mice, 
assays that utilize cells infected with recombinant viruses that contain T1R genes. 

Also within the scope of the invention are structure-based analyses wherein the 

15 X-ray crystal structure of a T1R or T1R fragment (or combination of TIRs, or a 
combination of a T1R with another protein) is determined and utilized to predict by 
molecular modeling techniques compounds that will bind to and/or enhance, mimic, 
block or modulate the particular T1R receptor or receptor combination. More 
particularly, the invention embraces the determination of the crystal structure of 

20 T1R1/T1R3 (preferably hTlRl/hTlR3) and/or T1R2/T1R3 (preferably hTlR2/hTlR3) 
and the use of such crystal structures in structure-based design methods to identify 
molecules that modulate T1R receptor activity. 

The invention especially includes biochemical assays conducted using cells, 
e.g., mammalian, yeast, insect or other heterologous cells that express one or more full 

25 length T1R receptors or fragments, preferably N-terminal domains of T1R1, T1R2 
and/or T1R3 . The effect of a compound in such assays can be determined using 
competitive binding assays, e.g., using radioactive glutamate or IMP, fluorescence 
(e.g., fluorescence polarization, FRET), or GTPy 35 S binding assays. As noted, in a 
preferred embodiment, such assays will utilize cell lines that stably co-express 

30 T1R1/T1R3 or T1R2/T1R3 and a suitable G protein, such as G a is. Other appropriate G 
proteins include the chimeric and variant G proteins disclosed in U.S. Application 
Serial No. 09/984,292 and 60/243,770, incorporated by reference in their entirety 
herein. 
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Still further, altered receptors can be constructed and expressed having 
improved properties, e.g., enhanced surface expression or G-protein coupling. These 
T1R variants can be incorporated into cell-based and biochemical assays. 

It is envisioned that the present discoveries relating to human TIRs will extend 
5 to other species, e.g., rodents, pigs, monkeys, dogs and cats, and perhaps even non- 
mammals such as fish. In this regard, several fish T1R fragments are identified infra in 
Example 1 . Therefore, the subject invention has application in screening for 
compounds for use in animal feed formulations. 

The invention further includes that utilize different allelic variants of various 

10 TIRs and combinations thereof, thereby enabling the identification of compounds that 
elicit specific taste sensation in individuals that express those allelic variants or 
compounds that elicit specific taste sensations in all individuals. Such compounds can 
be used to make foods more generally palatable. 

T1R encoding nucleic acids also provide valuable probes for the identification 

15 of taste cells, as the nucleic acids are specifically expressed in taste cells. For example, 
probes for T1R polypeptides and proteins can be used to identify taste cells present in 
foliate, circumvallate, and fungiform papillae, as well as taste cells present in the 
geschmackstreifen, oral cavity, gastrointestinal epithelium, and epiglottis. In particular, 
methods of detecting TIRs can be used to identify taste cells sensitive to sweet and/or 

20 umami taste stimuli or other taste stimuli representing other taste modalities. For 

example, cells stably or transiently expressing T1R2 and/or T1R3 would be predicted 
from the work herein to be responsive to sweet taste stimuli. Similarly, cells expressing 
T1R1 and/or T1R3 would be predicted to be responsive to umami taste stimuli. The 
nucleic acids encoding the T1R proteins and polypeptides of the invention can be 

25 isolated from a variety of sources, genetically engineered, amplified, synthesized, 

and/or expressed recombinantly according to the methods disclosed in WO 00/035374, 
which is herein incorporated by reference in its entirety. A listing of TIRs that may be 
expressed according to the invention are provided in the Examples. However, it should 
be emphasized that the invention embraces the expression and use of other specific 

30 TIRs or fragments, variants, or chimeras constructed based on such T1R sequences, 
and particularly TIRs of other species. 

As disclosed, an important aspect of the invention is the plurality of methods of 
screening for modulators, e.g., activators, inhibitors, stimulators, enhancers, agonists, 



63 



WO 2005/015158 



PCT/US2004/025459 



and antagonists, of these taste-cell-specific GPCRs. Such modulators of taste 
transduction are useful for the modulation of taste signaling pathways. These methods 
of screening can be used to identify high affinity agonists and antagonists of taste cell 
activity. These modulatory compounds can then be used in the food industry to 
5 customize taste, e.g. , to modulate the sweet and/or umami tastes of foods. 

This invention rectifies the previous lack of understanding relating to sweet and 
umami taste as it identifies specific TIRs and T1R receptor combinations that mediate 
sweet and umami taste sensation. Therefore, in general, this application relates to the 
inventors' discoveries relating to the T1R class of taste-specific G-protein-coupled 
1 0 receptors and their specific function in taste perception and the relationship of these 
discoveries to a better understanding of the molecular basis of taste. 

The molecular basis of sweet taste and umami taste - the savor of monosodium 
glutamate - is enigmatic. Recently, a three-member class of taste-specific G-protein- 
coupled receptors, termed TIRs, was identified. Overlapping T1R expression patterns 
1 5 and the demonstration that the structurally related GAB A B receptor is heterodimeric 
suggest that the TIRs function as heterodimeric taste receptors. In the examples infra, 
the present inventors describe the functional co-expression of human T1R1, T1R2, and 
T1R3 in heterologous cells; cells co-expressing T1R1 and T1R3 are activated by 
umami taste stimuli; cells co-expressing T1R2 and T1R3 are activated by sweet taste 
20 stimuli. T1R1/T1R3 and T1R2/T1R3 activity correlated with psychophysical detection 
thresholds. In addition, the 5 '-ribonucleotide IMP was found to enhance the 
T1R1/T1R3 response to glutamate, a synergism characteristic of umami taste. These 
findings demonstrate that specific TIRs and particularly different combinations of the 
TIRs function as sweet and umami taste receptors. 
25 Human perception of bitter, sweet, and umami is thought to be mediated by G- 

protein-coupled receptors (Lindemann, B., Physiol Res. 76:71 8-66 (1996)). Recently, 
evaluation of the human genome revealed the T2R class of bitter taste receptors (Adler 
et al., Cell 100:613-702 (2000); Chandrasgekar et al., Cell 100:703-11 (2000); 
Matsunami et al., Nature 404: 601-604 (2000)) but the receptors for sweet and umami 
30 taste have not been identified. Recently, another class of candidate taste receptors, the 
TIRs, was identified. The TIRs were first identified by large-scale sequencing of a 
subtracted cDNA library derived from rat taste tissue, which identified T1R1, and 
subsequently by T1R1 -based degenerate PCR, which led to the identification of T1R2 
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(Hoon et al., Cell 96:541-55 1 (1999)). Recently, the present inventors and others 
identified a third and possibly final member of the T1R family, T1R3, in the human 
genome databank (Kitagawa et al., Biochem Biophys. Res Commun. 283(1): 236-42 
(2001); Max et al., Nat Genet 28(1): 58-63 (2001); Sainz et al., J. Neurochem. 77(3): 
5 896-903 (2001); Montmayeur et al., Nat Neurosci 4, 492-8. (2001)). Tellingly, mouse 
T1R3 maps to a genomic interval containing Sac, a locus that influences sweet taste in 
the mouse (Fuller et al, J. Hered. 65:33-6 (1974); Li et al., Mamm. Genome 12: 13-16 
(2001)). Therefore, T1R3 was predicted to function as a sweet taste receptor. Recent 
high-resolution genetic mapping studies have strengthened the connection between 

10 mouse T1R3 and Sac (Fuller T.C., X Hered. 65(1): 33-36 (1974); Li et al., Mammal 
Genome 12(1): 13-16 (2001)). 

Interestingly, all C-family receptors that have been functionally expressed thus 
far-metabotropic glutamate receptors, the GABA B receptor, the calcium-sensing 
receptor (Conigrave, A. D., Quinn, S. J. & Brown, E. M., Proc Natl Acad Sci USA 97, 

15 4814-9. (2000)), and a fish olfactory receptor (Speca, D. J. et al., Neuron 23, 487-98. 

(1999)) - have been shown to be activated by amino acids. This common feature raises » 
the possibility that the TIRs recognize amino acids, and that the TIRs may be involved 
in the detection of glutamate in addition to sweet-tasting amino acids. Alternatively, a 
transcriptional variant of the mGluR4 metabotropic glutamate receptor has been 

20 proposed to be the umami taste receptor because of its selective expression in rat taste 
tissue, and the similarity of the receptor-activation threshold to the glutamate 
psychophysical detection threshold (Chaudhari et al., Nat Neurosci 3:113-119 (2000)). 
This hypothesis is difficult to reconcile with the exceedingly low expression level of 
the mGluR4 variant in taste tissue, and the more or less unaltered glutamate taste of 

25 mGluR4 knockout mice (Chaudhari and Roper, Ann. NY. Acad. Scl 855:398-406 

(1998)). Furthermore, the taste variant is structurally implausible, lacking not only the 
majority of the residues that form the glutamate-binding pocket of the wild-type 
receptor, but also approximately half of the globular N-terminal glutamate-binding 
domain (Kunishima et al., Nature 407:971-7 (2000)). 

30 Comparative analysis of T1R expression patterns in rodents has demonstrated 

that T1R2 and possibly T1R1 are each coexpressed with T1R3 (Hoon et al., Cell 
96:541-51 (1999); Kitagawa et al, Biochem Biophy. Res. Commun. 283:236-242 
(2001); Max et al., Nat. Genet. 28:58-63 (2001); Montmayeur et al., Nat Neurosci 



65 



WO 2005/015158 



PCT/US2004/025459 



4:492-8 (2001); Sainz et al, J. Neurochem 77:896-903 (2001)). Furthermore, 
dimerization is emerging as a common theme of C-family receptors: the metabotropic 
glutamate and calcium-sensing receptor are homodimers (Romomano et al., J. Biol 
Chem. 271:28612-6 (1996); Okamoto et al., J. Biol Chem. 273: 13089-96 (1998); Han 
5 et al., J. Biol Chem. 274:100008-13 (1999); Bai et al., Biol Chem. 273:23605-10 
(1998)), and the structurally related GABA B receptor is heterodimeric (Jones et al., 
Nature 396:674-9 (1998); Kaupmann et al., Nature 396:683-687 (1998); White et al., 
Nature 396: 679-682 (1998); Kuner et al., Science 283:74-77 (1999)). The present 
inventors have demonstrated by functional coexpression of TIRs in heterologous cells 
1 0 that human T1R2 functions in combination with human T1R3 as a sweet taste receptor 
and that human T1R1 functions in combination with human T1R3 as an umami taste 
receptor. 

The discoveries discussed herein are especially significant, as previously the 
development of improved artificial sweeteners has been hampered by the lack of assays 

15 for sweet taste. Indeed, the five commonly used commercial artificial sweeteners, all of 
which activate hTlR2/hTlR3, were discovered serendipitously. Similarly, other than 
sensory testing, a laborious process, there is no assay for identifying compounds that 
modulate umami taste. These problems are now alleviated because, as established by 
experimental results discussed infra, the human sweet and umami receptors have been 

20 identified, and assays for these receptors have been developed, particularly assays that 
use cells that stably express a functional T1R taste receptor, i.e. the sweet or umami 
taste receptor. 

Based thereon the invention provides assays for detecting and characterizing 
taste-modulating compounds, wherein T1R family members act, as they do in the taste 

25 bud, as reporter molecules for the effect on sweet and umami taste of taste-modulating 
compounds. Particularly provided and within the scope of the invention are assays for 
identifying compounds that modulate, mimic, enhance and/or block individually, sweet 
and umami tastes. Methods for assaying the activity of GPCRs, and especially 
compounds that affect GPCR activity are well known and are applicable to the T1R 

30 family member of the present invention and functional combinations thereof. Suitable 
assays have been identified supra. 
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The invention also provides compounds that bind T1R1, T1R2, T1R3, 
T1R2/T1R3 or T1R1/T1R3, or any fragment, portion, or subunit thereof, as disclosed 
throughout. 

In particular, the subject GPCRs can be used in assays to, e.g„ measure changes 
5 in ligand binding, ion concentration, membrane potential, current flow, ion flux, 
transcription, receptor-ligand interactions, second messenger concentrations, in vitro 
and in vivo. In another embodiment, T1R family members may be recombinantly 
expressed in cells, and the modulation of taste transduction via GPCR activity may be 
assayed by measuring changes in Ca 2+ levels and other intracellular messages such as 

10 cAMP, cGMP, or IP 3 . 

In certain assays, a domain of a T1R polypeptide, e.g., an extracellular, 
transmembrane, or intracellular domain, is fused to a heterologous polypeptide, thereby 
forming a chimeric polypeptide, e.g., a chimeric protein with GPCR activity. 
Particularly contemplated is the use of fragments of T1R1, T1R2 or T1R3 containing 

15 the N-terminal ligand-binding domain. Such proteins are useful, e.g., in assays to 
identify ligands, agonists, antagonists, or other modulators of T1R receptors. For 
example, a T1R polypeptide can be expressed in a eukaryotic cell as a chimeric 
receptor with a heterologous, chaperone sequence that facilitates plasma membrane 
trafficking, or maturation and targeting through the secretory pathway. The optional 

20 heterologous sequence may be a PDZ domain-interacting peptide, such as a C-terminal 
PDZIP fragment (SEQ ID NO 1). PDZIP is an ER export signal, which, according to 
the present invention, has been shown to facilitate surface expression of heterologous 
proteins such as the T1R receptors described herein. More particularly, in one aspect of 
the invention, PDZIP can be used to promote proper targeting of problematic 

25 membrane proteins such as olfactory receptors, T2R taste receptors, and the T1R taste 
receptors described herein. 

Examples of such chimeric receptors include trans-species receptors. Any 
combination of receptor subunits from various species can be used together to form a 
chimeric receptor, which can then be used to identify tastants, for example. Therefore, 

30 contemplated herein is a chimeric T1R2/T1R3 receptor comprising a human T1R2 
subunit and a rat T1R3 subunit. Also contemplated is a chimeric T1R2/T1R3 receptor 
comprising, a rat T1R2 subunit and a human T1R3 subunit. Also contemplated is a 
chimeric T1R2 receptor subunit comprising, a human extracellular domain, a rat 
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transmembrane domain and a rat intracellular domain (SEQ ID NOS: 16 and 17, for 
example). Also contemplated is chimeric T1R3 receptor subunit comprising, a rat 
extracellular domain, a human transmembrane domain and a human intracellular 
domain (SEQ ID NOS: 18 and 19, for example.) 

5 Such chimeric T1R receptors can be expressed in any eukaryotic cell, such as 

HEK-293 cells. Preferably, the cells contain a G protein, preferably a promiscuous G 
protein such as G a is or G a i6 or another type of promiscuous G protein capable of 
linking a wide range of GPCRs to an intracellular signaling pathway or to a signaling 
protein such as phospholipase C. Activation of such chimeric receptors in such cells 

10 can be detected using any standard method, such as by detecting changes in 

intracellular calcium by detecting FURA-2 dependent fluorescence in the cell. If 
preferred host cells do not express an appropriate G protein, they may be transfected 
with a gene encoding a promiscuous G protein such as those described in U.S. 
Application Serial No. 60/243,770, U.S. Application Serial No. 09/984,297, filed 

15 October 29, 2001, and U.S. Application Serial No. 09/989,497 filed November 21, 
2001 which are herein incorporated by reference in its entirety. 

Additional methods of assaying for modulators of taste transduction include in 
vitro ligand-binding assays using: T1R polypeptides, portions thereof, the 
extracellular domain, transmembrane region, or combinations thereof, or chimeric 

20 proteins comprising one or more domains of a T1R family member; oocyte or tissue 
culture cells expressing T1R polypeptides, fragments, or fusion proteins; 
phosphorylation and dephosphorylation of T1R family members; G protein binding to 
GPCRs; ligand-binding assays; voltage, membrane potential and conductance changes; 
ion flux assays; changes in intracellular second messengers such as cGMP, cAMP and 

25 inositol triphosphate (IP3); and changes in intracellular calcium levels. 

Further, the invention provides methods of detecting T1R nucleic acid and 
protein expression, allowing investigation of taste transduction regulation and specific 
identification of taste receptor cells. T1R family members also provide useful nucleic 
acid probes for paternity and forensic investigations. T1R genes are also useful as 

30 nucleic acid probes for identifying taste receptor cells, such as foliate, fungiform, 

circumvallate, geschmackstreifen, and epiglottis taste receptor ceils. T1R receptors can 
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also be used to generate monoclonal and polyclonal antibodies useful for identifying 
taste receptor cells. 

Functionally, the T1R polypeptides comprise a family of related seven 
transmembrane G protein-coupled receptors, which are believed to be involved in taste 
5 transduction and may interact with a G protein to mediate taste signal transduction (see, 
e.g., Fong, Cell Signal, 8:217 (1996); Baldwin, Curr. Opin. Cell Biol, 6:180 (1994)). 
Structurally, the nucleotide sequences of T1R family members encode related 
polypeptides comprising an extracellular domain, seven transmembrane domains, and a 
cytoplasmic domain. Related T1R family genes from other species share at least about 
10 50%, and optionally 60%, 70%, 80%, or 90%, nucleotide sequence identity over a 
region of at least about 50 nucleotides in length, optionally 100, 200, 500, or more 
nucleotides in length to the T1R nucleic acid sequences disclosed herein in the 
Examples, or conservatively modified variants thereof, or encode polypeptides sharing 
at least about 35 to 50%, and optionally 60%, 70%, 80%, or 90%, amino acid sequence 
15 identity over an amino acid region at least about 25 amino acids in length, optionally 50 
to 100 amino acids in length to a T1R polypeptide sequence disclosed infra in the 
Examples conservatively modified variants thereof. 

Several consensus amino acid sequences or domains have also been identified 
that are characteristic of T1R family members. For example, T1R family members 
20 typically comprise a sequence having at least about 50%, optionally 55%, 60%, 65%, 
70%, 75%, 80%, 85%, 90%, 95-99%, or higher, identity to T1R consensus sequences 1 
and 2 (SEQ ID NOs. 2 and 3, respectively). These conserved domains thus can be used 
to identify members of the T1R family, by identity, specific hybridization or 
amplification, or specific binding by antibodies raised against a domain. T1R 
25 consensus sequences include by way of example the following sequences: 
T1R Family Consensus Sequence, 1 : (SEQ ID NO: 2) 
(TR)C(FL)(RQP)R(RT)(SPV)(VERKT)FL(AE)(WL)(RHG)^ 
T1R Family Consensus Sequence 2: (SEQ ID NO: 3) 
(LQ)P(EGT)(NRC)YN(RE)^ 
30 These consensus sequences are inclusive of those found in the T1R 

polypeptides described herein, but T1R family members from other organisms maybe 
expected to comprise consensus sequences having about 75% identity or more to the 
inclusive consensus sequences described specifically herein. 
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Specific regions of the T1R nucleotide and amino acid sequences maybe used 
to identify polymorphic variants, interspecies homologs, and alleles of T1R family 
members. This identification can be made in vitro, e.g., under stringent hybridization 
conditions or PCR (e.g., using primers encoding the T1R consensus sequences 
5 identified above), or by using the sequence information in a computer system for 
comparison with other nucleotide sequences. Different alleles of T1R genes within a 
single species population will also be useful in deteimining whether differences in 
allelic sequences control differences in taste perception between members of the 
population. Classical PCR-type amplification and cloning techniques are useful for 
10 isolating new TIRs, for example, where degenerate primers are sufficient for detecting 
related genes across species. 

Typically, identification of polymorphic variants and alleles of T1R family 
members can be made by comparing an amino acid sequence of about 25 amino acids 
or more, e.g., 50-100 amino acids. Amino acid identity of approximately at least 35 to 
15 50%, and optionally 60%, 70%, 75%, 80%, 85%, 90%, 95-99%, or above typically 
demonstrates that a protein is a polymorphic variant, interspecies homolog, or allele of 
a T1R family member. Sequence comparison can be performed using any of the 
sequence comparison algorithms discussed below. Antibodies that bind specifically to 
T1R polypeptides or a conserved region thereof can also be used to identify alleles, 
20 interspecies homologs, and polymorphic variants. 

Polymorphic variants, interspecies homologs, and alleles of T1R genes can be 
confirmed by examining taste-cell-specific expression of the putative T1R gene or 
protein. Typically, T1R polypeptides having an amino acid sequence disclosed herein 
can be used as a positive control in comparison to the putative T1R polypeptide to 
25 demonstrate the identification of a polymorphic variant or allele of the T1R family 

member. The polymorphic variants, alleles, and interspecies homologs are expected to 
retain the seven transmembrane structure of a G protein-coupled receptor. For further 
detail, .see WO 00/06592, which discloses related T1R family members, GPCR-B3s, 
the contents of which are herein incorporated by reference in a manner consistent with 
30 this disclosure. GPCR-B3 receptors are referred to herein as rTlRl and mTlRl . 
Additionally, see WO 00/06593, which also discloses related T1R family members, 
GPCR-B4s, the contents of which are herein incorporated by reference in a manner 
consistent with this disclosure. GPCR-B4 receptors are referred to herein as rTlR2 and 
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mTlR2. As discussed previously, the invention also includes structure-based assays 
that utilize the x-ray crystalline structure of a T1R or T1R combination, e.g., 
hTlR2/hTlR3 or hTlRl/hTlR3, to identify molecules that modulate T1R receptor 
activity, and thereby modulate sweet and/or umami taste. 
5 The present invention also provides assays, preferably high throughput assays, 

to identify molecules that enhance, mimic, block and/or modulate T1R receptors. In 
some assays, a particular domain of a T1R family member is used in combination with 
a particular domain of another T 1 R family member, e.g. , an extracellular, 
transmembrane, or intracellular domain or region. In other embodiments, an 

10 extracellular domain, transmembrane region or combination thereof may be bound to a 
solid substrate, and used, e.g, to isolate ligands, agonists, antagonists, or any other 
molecules that can bind to and/or modulate the activity of a T1R polypeptide. 

Various conservative mutations and substitutions are envisioned to be within the 
scope of the invention. For instance, it is within the level of skill in the art to perfoim 

15 amino acid substitutions using known protocols of recombinant gene technology 

including PCR, gene cloning, site-directed mutagenesis of cDNA, transfection of host 
cells, and in-vitro transcription. The variants could then be screened for activity. 

Definitions 

20 As used herein, the following terms have the meanings ascribed to them unless 

specified otherwise. 

'Taste cells" include neuroepithelial cells that are organized into groups to 
form taste buds of the tongue, e.g., foliate, fungiform, and circumvallate cells {see, e.g., 
Roper et al., Ann. Rev. Neurosci. 12:329-353 (1989)). Taste cells are also found in the 

25 palate and other tissues, such as the esophagus and the stomach. 

"T1R" refers to one or more members of a family of G protein-coupled 
receptors that are expressed in taste cells such as foliate, fungiform, and circumvallate 
cells, as well as cells of the palate, and esophagus {see, e.g., Hoon et al., Cell, 
96:541-55 1 (1999), herein incorporated by reference in its entirety). Members of this 

30 family are also referred to as GPCR-B3 and TR1 in WO 00/06592 as well as GPCR-B4 
and TR2 in WO 00/06593. GPCR-B3 is also herein referred to as iTlRl, and GPCR- 
B4 is referred to as rTlR2. Taste receptor cells can also be identified on the basis of 
morphology {see, e.g., Roper, supra), or by the expression of proteins specifically 
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expressed in taste cells. T1R family members may have the ability to act as receptors 
for sweet taste transduction, or to distinguish between various other taste modalities. 
Representative T1R sequences, including hTlRl, hTlR2 and hTlR3 are identified 
infra in the examples. 

5 "T1R" nucleic acids encode a family of GPCRs with seven transmembrane 

regions that have "G protein-coupled receptor activity ," e.g. , they may bind to G 
proteins in response to extracellular stimuli and promote production of second 
messengers such as TP 3, cAMP, cGMP, and Ca 2+ via stimulation of enzymes such as 
phospholipase C and adenylate cyclase (for a description of the structure and function 

10 of GPCRs, see, e.g., Fong, supra, and Baldwin, supra). A single taste cell may contain 
many distinct T1R polypeptides. 

The term "T1R" family therefore refers to polymorphic variants, alleles, 
mutants, and interspecies homologs that: (1) have at least about 35 to 50% amino acid 
sequence identity, optionally about 60, 75, 80, 85, 90, 95, 96, 97, 98, or 99% amino 

15 acid sequence identity to a T1R polypeptide, preferably those identified in Example 1, 
over a window of about 25 amino acids, optionally 50-100 amino acids; (2) specifically 
bind to antibodies raised against an immunogen comprising an amino acid sequence 
preferably selected from the group consisting of the T1R polypeptide sequence 
disclosed in Example 1 and conservatively modified variants thereof; (3) are encoded 

20 by a nucleic acid molecule which specifically hybridize (with a size of at least about 
100, optionally at least about 500-1000 nucleotides) under stringent hybridization 
conditions to a sequence selected from the group consisting of the T1R nucleic acid 
sequences contained in Example 1, and conservatively modified variants thereof; or (4) 
comprise a sequence at least about 35 to 50% identical to an amino acid sequence 

25 selected from the group consisting of the T1R amino acid sequence identified in 
Example 1. 

Topologically, the TIRs disclosed herein have an ''N-terminal domain" also 
called "extracellular domain" comprising a 'Venus flytrap domain" and a "cysteine rich 
domain;" "transmembrane domains" comprising seven transmembrane regions, and 
30 corresponding cytoplasmic, and extracellular loops; and a "C-terminal domain" {see, 
e.g., Hoon et al, Cell, 96:541-551 (1999); Buck & Axel, Cell, 65:175-187 (1991)). 
These domains have been structurally identified using methods known to those of skill 
in the art, such as sequence analysis programs that identify hydrophobic and 
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hydrophilic domains (Stryer, Biochemistry, (3rd ed. 1988). Such domains are useful for 
making chimeric proteins and for in vitro assays of the invention, e.g., ligand binding 
assays. The specific binding of a compound to these structurally defined domains 
provides provides structural definition for the compound. 

5 "Extracellular domains" therefore refers to the domains of T1R polypeptides 

that protrude from the cellular membrane and are exposed to the extracellular face of 
the cell. Such domains generally include the "N terminal domain" that is exposed to 
the extracellular face of the cell, and optionally can include portions of the extracellular 
loops of the transmembrane domain that are exposed to the extracellular face of the 

10 cell, i.e., the loops between transmembrane regions 2 and 3, between transmembrane 
regions 4 and 5, and between transmembrane regions 6 and 7. 

The "N-terminal domain" region starts at the N-terminus and extends to a 
region close to the start of the first transmembrane domain. More particularly, in one -- 
embodiment of the invention, this domain starts at the N-terminus and ends 

15 approximately at the conserved glutamic acid at amino acid position 563 plus or minus 
approximately 20 amino acids. These extracellular domains are useful for in vitro 
ligand-binding assays, both soluble and solid phase, hi addition, transmembrane 
regions, described below, can also bind ligand either in combination with the 
extracellular domain, and are therefore also useful for in vitro ligand-binding assays. 

20 "Cysteine-rich domain" refers to the domain of the polypeptides. This 

conserved sequence contains several highly-conserved Cys residues that form 
disulphide bridges, and lies outside the cell membrane.. This region corresponds to the 
domain of the T1R family members and is found in all three subunits, T1R1-T1R3. The 
cysteine rich sequence is found in amino acids 510-566 of T1R1, 508-565 of T1R2, and 

25 512-568 or T1R3. 

"Transmembrane domain," which comprises the seven "transmembrane 
regions," refers to the domain of T1R polypeptides that lies within the plasma 
membrane, and may also include the corresponding cytoplasmic (intracellular) and 
extracellular loops. In one embodiment, this region corresponds to the domain of T1R 

30 family members which starts approximately at the conserved glutamic acid residue at 
amino acid position 563 plus or minus 20 amino acids and ends approximately at the 
conserved tyrosine amino acid residue at position 812 plus or minus approximately 10 
amino acids. The seven transmembrane regions and extracellular and cytoplasmic loops 
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can be identified using standard methods, as described in Kyte & Doolittle, J. Mol 
Biol, 157:105-32 (1982)), or in Stryer, supra. 

"Cytoplasmic domains" refers to the domains of T1R polypeptides that face the 
inside of the cell, e.g., the "C-terminal domain" and the intracellular loops of the 
5 transmembrane domain, e.g., the intracellular loop between transmembrane regions 1 
and 2, the intracellular loop between transmembrane regions 3 and 4, and the 
intracellular loop between transmembrane regions 5 and 6. 

"C-terminal domain" refers to the region that spans the end of the last 
transmembrane domain and the C-terminus of the protein, and which is normally 

10 located within the cytoplasm. In one embodiment, this region starts at the conserved 
tyrosine amino acid residue at position 812 plus or minus approximately 10 amino 
acids and continues to the C-terminus of the polypeptide. 

The term "ligand-binding region" or "ligand-binding domain" refers to 
sequences derived from a taste receptor, particularly a taste receptor that substantially •. 

1 5 incorporates at least the extracellular domain of the receptor. In one embodiment, the 
extracellular domain of the ligand-binding region may include the N-terminal domain 
and, optionally, portions of the transmembrane domain, such as the extracellular loops 
of the transmembrane domain. The ligand-binding region may be capable of binding a 
ligand, and more particularly, a compound that enhances, mimics, blocks, and/or 

20 modulates taste, e.g., sweet or umami taste. 

The phrase 'Tieteromultimer" or "heteromultimeric complex" in the context of 
the T1R receptors or polypeptides of the invention refers to a functional association of 
at least one T1R receptor and another receptor, typically another T1R receptor 
polypeptide (or, alternatively another non-TIR receptor polypeptide). For clarity, the 

25 functional co-dependence of the TIRs is described in this application as reflecting their 
possible function as heterodimeric taste receptor complexes. However, as discussed 
previously, functional co-dependence may alternatively reflect an indirect interaction. 
For example, T1R3 may function solely to facilitate surface expression of T1R1 and 
T1R2, which may act independently as taste receptors. Alternatively, a functional taste 

30 receptor may be comprised solely of T1R3, which is differentially processed under the ; 
control of T1R1 or T1R2, analogous to RAMP-dependent processing of the calcium- 
related receptor. 
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The phrase "functional effects" in the context of assays for testing compounds 
that modulate T1R family member mediated taste transduction includes the 
determination of any parameter that is indirectly or directly under the influence of the 
receptor, e.g., functional, physical and chemical effects. It includes ligand binding, 
5 changes in ion flux, membrane potential, current flow, transcription, G protein binding, 
GPCR phosphorylation or dephosphorylation, conformation change-based assays, 
signal transduction, receptor-ligand interactions, second messenger concentrations {e.g., 
cAMP, cGMP, IP3, or intracellular Ca 2+ ), in vitro, in vivo, and ex vivo and also includes 
other physiologic effects such increases or decreases of neurotransmitter or hormone 
10 release. 

By "detennining the functional effect" in the context of assays is meant assays 
for a compound that increases or decreases a parameter that is indirectly or directly 
under the influence of a T1R family member, e.g., functional, physical and chemical 
effects. Such functional effects can be measured by any means known to those skilled - 

15 in the art, e.g, changes in spectroscopic characteristics (e.g., fluorescence, absorbency, 
refractive index), hydrodynamic (e.g., shape), chromatographic, or solubility properties, 
patch clamping, voltage-sensitive dyes, whole cell currents, radioisotope efflux, 
inducible markers, oocyte T1R gene expression; tissue culture cell T1R expression; 
transcriptional activation of T1R genes; ligand-binding assays; voltage, membrane 

20 potential and conductance changes; ion flux assays; changes in intracellular second 
messengers such as cAMP, cGMP, and inositol triphosphate (IP3); changes in 
intracellular calcium levels; neurotransmitter release, conformational assays and the 
like. 

A "flavor or tastant" herein refers to a compound or biologically acceptable salt 
25 thereof that induces, in a subject, the perception of smell and/or taste, which include 
sweet, sour, salty, bitter and umami, and others. The subject can be human, animals, 
and/or a biological assay, such as the ones described and cited in this application. 

A "flavor or taste modifier" herein refers to a compound or biologically 
acceptable salt thereof that modulates, including enhancing or potentiating, inhibiting, 
30 and inducing, the smell and/or tastes of a natural or synthetic tastants in a subject. 

A "flavor or taste enhancer" herein refers to a compound or biologically 
acceptable salt thereof that enhances the tastes or smell of a natural or synthetic 
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I 

tastants, eg., monosodium glutamate (MSG) for umami taste and fructose for sweet 
taste. 

"Umami tastant" or "umami compound*' herein refers to a compound or 
biologically acceptable salt thereof that elicits a detectable umami taste in a subject, 
5 eg., MSG. 

"Sweet tastanf 1 or "sweet compound" herein refers to a compound or 
biologically acceptable salt thereof that elicits a detectable sweet taste in a subject, 
e.g., fructose. 

An "umami taste modifier" herein refers to a compound or biologically 
10 acceptable salt thereof that modulates, including enhancing or potentiating, inhibiting, 
and inducing, the umami taste of a natural or synthetic umami tastants, e.g., 
monosodium glutamate (MSG) in a subject. 

A "sweet taste modifier** herein refers to a compound or biologically acceptable 
salt thereof that modulates, including enhancing or potentiating, inhibiting, and 
15 inducing, the sweet taste of a natural or synthetic sweet tastants, eg., fructose, in a 
subject. 

A "taste enhancing amount" herein refers to an amount of a compound that is 
sufficient to enhance the taste of a natural or synthetic tastants, eg., monosodium 
glutamate (MSG) for umami taste or fructose for sweet taste. 

20 "Wet Soup Category" means wet/liquid soups regardless of concentration or 

container, including frozen Soups. For the purpose of this definition soup(s) means a 
food prepared from meat, poultry, fish, vegetables, grains, fruit and other ingredients, 
cooked in a liquid which may include visible pieces of some or all of these ingredients. 
It may be clear (as a broth) or thick (as a chowder), smooth, pureed or chunky, ready- 

25 to-serve, semi-condensed or condensed and may be served hot or cold, as a first course 
or as the main course of a meal or as a between meal snack (sipped like a beverage). 
Soup may be used as an ingredient for preparing other meal components and may range 
. from broths (consomm6) to sauces (cream or cheese-based soups). 

c< Dehydrated and Culinary Food Category** means: (i) Cooking aid products 

30 such as: powders, granules, pastes, concentrated liquid products, including concentrated 
bouillon, bouillon and bouillon like products in pressed cubes, tablets or powder or 
granulated foim, which are sold separately as a finished product or as an ingredient 
within a product, sauces and recipe mixes (regardless of technology); (ii) Meal 
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solutions products such as: dehydrated and freeze dried soups, including dehydrated 
soup mixes, dehydrated instant soups, dehydrated ready-to-cook soups, dehydrated or 
ambient preparations of ready-made dishes, meals and single serve entries including 
pasta, potato and rice dishes; and (iii) Meal embellishment products such as: 
5 condiments, marinades, salad dressings, salad toppings, dips, breading, batter mixes, 
shelf stable spreads, barbecue sauces, liquid recipe mixes, concentrates, sauces or sauce 
mixes, including recipe mixes for salad, sold as a finished product or as an ingredient 
within a product, whether dehydrated, liquid or frozen. 

"Beverage Category" means beverages, beverage mixes and concentrates, 
10 including but not limited to, alcoholic and non-alcoholic ready to drink and dry 

powdered Other examples of foods and beverages wherein compounds according 
to the invention may be incorporated included by way of example carbonated and non- 
carbonated beverages, e.g., sodas, juices, alcoholic and non-alcoholic beverages, 
confectionary products, e.g., cakes, cookies, pies, candies, chewing gums, gelatins, ice 
1 5 creams, sorbets, puddings, jams, jellies, salad dressings, and other condiments, cereal, 
and other breakfast foods, canned fruits and fruit sauces and the like. 

Additionally, the subject compounds can be used in flavor preparations to be 
added to foods and beverages. In preferred instances the composition will comprise 
another flavor or taste modifier such as a sweet tastant 
20 In some instances biologically acceptable salts of the subject compounds may 

be used. Examples of such salts include alkali and earth metal salts, organic salts, and 
the like. Specific examples include potassium, sodium, calcium and magnesium salts, 
hydrochloric or sulfuric acid salts, ethanolamine salts, and the like. The salt will be 
selected such that it is biologically safe for ingestion and does adversely affect the 
25 sweet taste modulatory properties of the compound. 

As used herein, the term "medicinal product" includes both solids and liquids 
which are ingestible non-toxic materials which have medicinal value such as cough 
syrups, cough drops, aspirin and chewable medicinal tablets. An oral hygiene product 
includes solids and liquids such as toothpaste or mouthwash. 
30 A "comestibly or medicinally acceptable carrier or excipient" is a medium that 

is used to prepare a desired dosage form of the inventive compound. A comestibly or 
medicinally acceptable carrier includes solvents, diluents, or other liquid vehicle; 
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dispersion or suspension aids; surface active agents; isotonic agents; thickening or 
emulsifying agents, preservatives; solid binders; lubricants and the like. 

"Inhibitors," "activators," "enhancers" and "modulators" of T1R genes or 
proteins are used to refer to inhibitory, activating, enhancing or modulating molecules 
5 identified using in vitro and in vivo assays for taste transduction, e.g. 9 ligands, agonists, 
antagonists, and their homologs and mimetics. 

Inhibitors are compounds that, e.g. , bind to, partially or totally block 
stimulation, decrease, prevent, delay activation, inactivate, desensitize, or down 
regulate taste transduction, e.g., antagonists. Activators and enhancers are compounds 

10 that, e.g., bind to, enhance, stimulate, increase, open, activate, facilitate, enhance 
activation, sensitize, or up regulate taste transduction, e.g, agonists. Modulators 
include compounds that, e.g. , alter the interaction of a receptor with: extracellular 
proteins that bind activators or inhibitor (e.g., ebnerin and other members of the 
hydrophobic carrier family); G proteins; kinases (e.g., homologs of rhodopsin kinase 

15 and beta adrenergic receptor kinases that are involved in deactivation and 

desensitization of a receptor); and arrestins, which also deactivate and desensitize 
receptors. Modulators can include genetically modified versions of T1R family 
members, e.g., with altered activity, as well as naturally occurring and synthetic 
ligands, antagonists, agonists, small chemical molecules and the like. Such assays for 

20 inhibitors and activators include, e.g., expressing T1R family members in cells or cell 
membranes, applying putative modulator compounds, in the presence or absence of 
tastants, e.g., sweet tastants, and then determining the functional effects on taste 
transduction, as described above. Samples or assays comprising T1R family members 
that are treated with a potential enhancer, activator, inhibitor, or modulator are 

25 compared to control samples without the inhibitor, activator, or modulator to examine 
the extent of modulation. Positive control samples (e.g. a sweet tastant without added 
modulators) are assigned a relative T1R activity value of 1 00%. 

"EC50" is defined as the amount of a compound that elicits 50% of the maximal 
response the compound can elicit, whether as an activator, enhancer, or modulator. A 

30 dose-dependent response curve was determined for a compound, and the compound 
concentration corresponding to 50% of the maximal response was derived from the 
curve, in one example. 
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"IC50" is defined as the amount of a compound that elicits 50% of the maximal 
effect the compound can elicit as an inhibitor. 

Regarding sweet tastants and enhancers, after a compound is identified, scores 
of their activities are given as percentage of the maximum fructose intensity (%). In 
5 compound dose response, an EC50 can be calculated to reflect the potency of the 
compound as a sweet agonist. In the present invention, an EC50 of lower than about 
100 mM is indicative of compounds that induce T1R2/T1R3 activity as a sweet agonist. 
Preferably, a positive hit for a sweet agonist has an EC50 value of less than about 1 mM; 
more preferably less than about a 10 joM. 
10 In sweet enhancement assay experiments, a fructose dose response was run and 

a second fructose dose response was run with a certain amount of candidate compound 
at every fructose concentrations at the same time. Then, the EC 50 ratio can be 
calculated based on the following definitions: 

EC50 Ratio » EC50 (fructose)/EC 5 o (fructose + [Compound]) 
15 » wherein "[compound]" refers to the concentration of compound used to elicit 

(or enhance or potentiate) the fructose dose response. Those concentrations could vary 
from a pM to an mM, more preferred, from a low nM to \iM. A potent sweet enhancer 
would have a high EC50 Ratio at a low concentration of the compound used. 

In the present invention, an EC50 ratio of greater than 1 is indicative of a 
20 compound that modulates (potentiates) T1R2/T1R3 activity and is an sweet enhancer. 
Preferably, a positive hit will have EC50 ratio values of at least 1.20, preferably ranging 
from at least 1 .50 to 100 or even higher. 

By contrast, competing agonists (those sweet tastants that bind mutually 
exclusively) or inhibitors always yield values of EC50 ratio less than 1, such as from 0- 
25 1. 

Regarding umami tastants and enhancers, scores of their activities can be given 
as percentage of the maximum MSG intensity (%). In compound dose response, an 
EC50 can be calculated to reflect the potency of the compound as umami agonist. In the 
present invention, an EC50 of lower than about 10 mM is indicative of compounds that 
30 induce T1R1/T1R3 activity and an umami agonist. Preferably, a positive hit for an 

umami agonist will have EC50 values of less than about 1 mM; more preferably ranging 
from about a pM to about a low ^M. 
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In enhancement assay experiments, a MSG dose response was run and a second 
MSG dose response was run with a certain amount of candidate compound at every 
MSG concentrations at the same time. Then, the EC50 ratio is calculated based on the 
following definitions: 
5 EC50 Ratio = EC 50 (MSG)/EC 5 o (MSG + [Compound]) 

wherein "[compound]" refers to the concentration of compound used to elicit 
(or enhance or potentiate) the MSG dose response. Those concentrations can vary 
from a pM to an mM, more preferred, from a low nM to nM. A potent umami 
enhancer has a high EC50 Ratio at a low concentration of the compound used. 

10 In the present invention, an EC 50 ratio of greater than 1 is indicative of a 

compound that modulates (potentiates) T1R1/T1R3 activity and in an umami enhancer. 
Preferably, a positive hit has EC 50 ratio values of at least 1 .20, preferably ranging from 
at least 1 .50 to 1 00 or even higher. 

Negative control samples (e.g. buffer without an added taste stimulus) are 

15 assigned a relative T1R activity value of 0%. Inhibition of a T1R is achieved when a 
mixture of the positive control sample and a modulator result in the T1R activity value 
relative to the positive control is about 80%, optionally 50% or 25-0%. Activation of a 
T1R by a modulator alone is achieved when the T1R activity value relative to the 
positive control sample is 10%, 25%, 50%, 75%, optionally 100%, optionally 150%, 

20 optionally 200-500%, or 1000-3000% higher. 

The terms purified," "substantially purified," and isolated" as used herein 
refer to the state of being free of other, dissimilar compounds with which the compound 
of the invention is normally associated in its natural state, so that the '^purified," 
"substantially purified," and "isolated" subject comprises at least 0.5%, 1%, 5%, 10%, 

25 or 20%, and most preferably at least 50% or 75% of the mass, by weight, of a given 
sample. In one preferred embodiment, these terms refer to the compound of the 
invention comprising at least 95% of the mass, by weight, of a given sample. As used 
herein, the terms "purified," "substantially purified," and "isolated," when referring to a 
nucleic acid or protein, also refers to a state of purification or concentration different 

30 than that which occurs naturally in the mammalian, especially human body. Any 
degree of purification or concentration greater than that which occurs naturally in the 
mammalian, especially human, body, including (1) the purification from other 
associated structures or compounds or (2) the association with structures or compounds 
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to which it is not normally associated in the mammalian, especially human, body, are 
within the meaning of "isolated/* The nucleic acid or protein or classes of nucleic acids 
or proteins, described herein, may be isolated, or otherwise associated with structures 
or compounds to which they are not normally associated in nature, according to a 
5 variety of methods and processes known to those of skill in the art. 

The term '^nucleic acid" or "nucleic acid sequence" refers to a deoxy- 
ribonucleotide or ribonucleotide oligonucleotide in either single- or double-stranded 
form. The term encompasses nucleic acids, i.e., oligonucleotides, containing known 
analogs of natural nucleotides. The term also encompasses nucleic-acid-like structures 
1 0 with synthetic backbones (see e.g. , Oligonucleotides and Analogues, a Practical 

Approach, ed. F. Eckstein, Oxford Univ. Press (1991); Antisense Strategies, Annals of 
the N. Y. Academy of Sciences, Vol. 600, Eds. Baserga et al (NY AS 1992); Milligan X 
Med. Chem. 36:1923-1937 (1993); Antisense Research and Applications (1993, CRC 
Press), WO 97/0321 1; WO 96/39154; Mata, Toxicol Appl Pharmacol 144:189-197 • 
1 5 (1997); Strauss-Soukup, Biochemistry 36:8692-8698 (1997); Samstag, Antisense 
Nucleic Acid Drug Dev, 6:153-156 (1996)). 

Unless otherwise indicated, a particular nucleic acid sequence also implicitly 
encompasses conservatively modified variants thereof (e.g. , degenerate codon 
substitutions) and complementary sequences, as well as the sequence explicitly 
20 indicated. Specifically, degenerate codon substitutions may be achieved by generating, 
e.g., sequences in which the third position of one or more selected codons is substituted 
with mixed-base and/or deoxyinosine residues (Batzer et al, Nucleic Acid Res., 
19:5081 (1991); Ohtsuka et al,J. Biol Chem., 260:2605-2608 (1985); Rossolini et al, 
Mol Cell Probes, 8:91-98 (1994)). The term nucleic acid is used interchangeably with 
25 gene, cDNA, mRNA, oligonucleotide, and polynucleotide. 

The terms "polypeptide," "peptide" and "protein" are used interchangeably 
herein to refer to a polymer of amino acid residues. The terms apply to amino acid 
polymers in which one or more amino acid residue is an artificial chemical mimetic of 
a corresponding naturally occurring amino acid, as well as to naturally occurring amino 
30 acid polymers and non-naturally occurring amino acid polymer. 

The term "plasma membrane translocation domain" or simply "translocation 
domain" means a polypeptide domain that, when incorporated into a polypeptide 
coding sequence, can with greater efficiency "chaperone" or "translocate" the hybrid 
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("fusion") protein to the cell plasma membrane than without the domain. For instance, 
a 'translocation domain" may be derived from the amino terminus of the bovine 
rhodopsin receptor polypeptide, a 7-transmembrane receptor. However, rhodopsin 
from any mammal may be used, as can other translocation facilitating sequences. Thus, 
5 the translocation domain is particularly efficient in translocating 7-transmembrane 
fusion proteins to the plasma membrane, and a protein {e.g., a taste receptor 
polypeptide) comprising an amino terminal translocating domain will be transported to 
the plasma membrane more efficiently than without the domain. However, if the N- 
tenninal domain of the polypeptide is active in binding, as with the T1R receptors of 

10 the present invention, the use of other translocation domains may be preferred. For 
instance, a PDZ domain-interacting peptide, as described herein, may be used. 

The "translocation domain," "ligand-binding domain", and chimeric receptors 
compositions described herein also include "analogs," or "conservative variants" and 
"mimetics" ("peptidomimetics") with structures and activity that substantially 

15 correspond to the exemplary sequences. Thus, the terms "conservative variant" or 
"analog" or "mimetic" refer to a polypeptide which has a modified amino acid 
sequence, such that the change(s) do not substantially alter the polypeptide's (the 
conservative variant's) structure and/or activity, as defined herein. These include 
conservatively modified variations of an amino acid sequence, ie. 9 amino acid 

20 substitutions, additions or deletions of those residues that are not critical for protein 
activity, or substitution of amino acids with residues having similar properties (e.g., 
acidic, basic, positively or negatively charged, polar or non-polar, etc.) such that the 
substitutions of even critical amino acids does not substantially alter structure and/or 
activity. 

25 More particularly, "conservatively modified variants" applies to both amino 

acid and nucleic acid sequences. With respect to particular nucleic acid sequences, 
conservatively modified variants refers to those nucleic acids which encode identical or 
essentially identical amino acid sequences, or where the nucleic acid does not encode 
an amino acid sequence, to essentially identical sequences. Because of the degeneracy 

30 of the genetic code, a large number of functionally identical nucleic acids encode any 
given protein. 

For instance, the codons GCA, GCC, GCG and GCU all encode the amino acid 
alanine. Thus, at every position where an alanine is specified by a codon, the codon can 
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be altered to any of the corresponding codons described without altering the encoded 
polypeptide. 

Such nucleic acid variations are "silent variations/* which are one species of 
conservatively modified variations. Every nucleic acid sequence herein, which encodes 
5 a polypeptide, also describes every possible silent variation of the nucleic acid. One of 
skill will recognize that each codon in a nucleic acid (except AUG, which is ordinarily 
the only codon for methionine, and TGG, which is ordinarily the only codon for 
tryptophan) can be modified to yield a functionally identical molecule. Accordingly, 
each silent variation of a nucleic acid, which encodes a polypeptide, is implicit in each 

1 0 described sequence. 

Conservative substitution tables providing functionally similar amino acids are 
well known in the art. For example, one exemplary guideline to select conservative 
substitutions includes (original residue followed by exemplary substitution): ala/gly or . 
ser; arg/lys; asn/gln or his; asp/gju; cys/ser; gln/asn; gly/asp; gly/ala or pro; his/asn or 

1 5 gin; ile/leu or val; leu/ile or val; lys/arg or gin or glu; met/leu or tyr or ile; phe/met or 
leu or tyr; ser/thr; thr/ser; trp/tyr; tyr/trp or phe; val/ile or leu. An alternative exemplary 
guideline uses the following six groups, each containing amino acids that are 
conservative substitutions for one another: 1) Alanine (A), Serine (S), Threonine (T); 2) 
Aspartic acid (D), Glutamic acid (E); 3) Asparagine (N), Glutamine (Q); 4) Arginine 

20 (R), Lysine (I); 5) Isoleucine (I), Leucine (L), Methionine (M), Valine (V); and 6) 

Phenylalanine (F), Tyrosine (Y), Tryptophan (W); (see also, e.g., Creighton, Proteins, 
W.H. Freeman and Company (1984); Schultz and Schimer, Principles of Protein 
Structure, Springer-Vrlag (1979)). One of skill in the art will appreciate that the above- 
identified substitutions are not the only possible conservative substitutions. For 

25 example, for some purposes, one may regard all charged amino acids as conservative 
substitutions for each other whether they are positive or negative. In addition, 
individual substitutions, deletions or additions that alter, add or delete a single amino 
acid or a small percentage of amino acids in an encoded sequence can also be 
considered "conservatively modified variations." 

30 The terms "mimetic" and "peptidomimetic" refer to a synthetic chemical 

compound that has substantially the same structural and/or functional characteristics of 
the polypeptides, e.g., translocation domains, ligand-binding domains, or chimeric 
receptors of the invention. The mimetic can be either entirely composed of synthetic, 
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non-natural analogs of amino acids, or may be a chimeric molecule of partly natural 
peptide amino acids and partly non-natural analogs of amino acids. The mimetic can 
also incorporate any amount of natural amino acid conservative substitutions as long as 
such substitutions also do not substantially alter the mimetic's structure and/or activity. 

5 As with polypeptides of the invention which are conservative variants, routine 

experimentation will determine whether a mimetic is within the scope of the invention, 
i.e., that its structure and/or function is not substantially altered. Polypeptide mimetic 
compositions can contain any combination of non-natural structural components, which 
are typically from three structural groups: a) residue linkage groups other than the 

10 natural amide bond (peptide bond") linkages; b) non-natural residues in place of 
naturally occurring amino acid residues; or c) residues which induce secondary 
structural mimicry, i.e., to induce or stabilize a secondary structure, e.g., a beta turn, 
gamma turn, beta sheet, alpha helix conformation, and the like. A polypeptide can be 
characterized as a mimetic when all or some of its residues are joined by chemical 

15 means other than natural peptide bonds. Individual peptidomimetic residues can be 
joined by peptide bonds, other chemical bonds or coupling means, such as, e.g., 
glutaraldehyde, N-hydroxysuccinimide esters, bifiinctional maleimides, N,N'- 
dicyclohexylcarbodiimide (DCC) or N,N'-diisopropylcarbodiimide (DIC). Linking 
groups that can be an alternative to the traditional amide bond ("peptide bond") 

20 linkages include, e.g., ketomethylene (e.g., -C(=0)-CH 2 - for -C(=0)-NH-), 

aminomethylene (CH 2 -NH), ethylene, olefin (CH=€H), ether (CH 2 -0), thioether (CH 2 - 
S), tetrazole (CN 4 ), thiazole, retroamide, or ester (see, e.g., Spatola, Chemistry and 
Biochemistry of Amino Acids, Peptides and Proteins, Vol. 7, pp 267-357, Peptide 
Backbone Modifications," Marcell Dekker, NY (1983)). A polypeptide can also be 

25 characterized as a mimetic by containing all or some non-natural residues in place of 
naturally occurring amino acid residues; non-natural residues are well described in the 
scientific and patent literature. 

A "label" or a "detectable moiety" is a composition detectable by spectroscopic, 
photochemical, biochemical, immunochemical, or chemical means. For example, useful 

30 labels include 32 P, fluorescent dyes, electron-dense reagents, enzymes (e.g. , as 

commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins which can 
be made detectable, e.g., by incorporating a radiolabel into the peptide or used to detect 
antibodies specifically reactive with the peptide. 



84 



WO 2005/015158 



PCT/US2004/025459 



A "labeled nucleic acid probe or oligonucleotide" is one that is bound, either 
covalently, through a linker or a chemical bond, or noncovalently, through ionic, van 
der Waals, electrostatic, or hydrogen bonds to a label such that the presence of the 
probe may be detected by detecting the presence of the label bound to the probe. 

5 As used herein a "nucleic acid probe or oligonucleotide" is defined as a nucleic 

acid capable of binding to a target nucleic acid of complementary sequence through one 
or more types of chemical bonds, usually through complementary base pairing, usually 
through hydrogen bond formation. As used herein, a probe may include natural (i.e., A, 
G, C, or T) or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in 

10 a probe may be joined by a linkage other than a phosphodiester bond, so long as it does 
not interfere with hybridization. Thus, for example, probes may be peptide nucleic ( 
acids in which the constituent bases are joined by peptide bonds rather than 
phosphodiester linkages. It will be understood by one of skill in the art that probes may 
bind target sequences lacking complete complementarity with the probe sequence 

1 5 depending upon the stringency of the hybridization conditions. The probes are 

optionally directly labeled as with isotopes, chromophores, lumiphores, chromogens, or 
indirectly labeled such as with biotin to which a streptavidin complex may later bind. 
By assaying for the presence or absence of the probe, one can detect the presence or 
absence of the select sequence or subsequence. 

20 The term "heterologous" when used with reference to portions of a nucleic acid 

indicates that the nucleic acid comprises two or more subsequences that are not found 
in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences from unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 

25 coding region from another source. Similarly, a heterologous protein indicates that the 
protein comprises two or more subsequences that are not found in the same relationship 
to each other in nature (e.g., a fusion protein). 

A "promoter" is defined as an array of nucleic acid sequences that direct 
transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic 

30 acid sequences near the start site of transcription, such as, in the case of a polymerase II 
type promoter, a TATA element. A promoter also optionally includes distal enhancer 
or repressor elements, which can be located as much as several thousand base pairs 
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from the start site of transcription. A "constitutive"promoter is a prolnoter that is active 
under most environmental and developmental conditions. 

An "inducible" promoter is a promoter that is active under environmental or 
developmental regulation. The tenn "operably linked" refers to a functional linkage 
5 between a nucleic acid expression control sequence (such as a promoter, or array of 
transcription factor binding sites) and a second nucleic acid sequence, wherein the 
expression control sequence directs transcription of the nucleic acid corresponding to 
the second sequence. 

As used herein, Recombinant" refers to a polynucleotide synthesized or 
1 0 otherwise manipulated in vitro (e.g. , "recombinant polynucleotide"), to methods of 
using recombinant polynucleotides to produce gene products in cells or other biological 
systems, or to a polypeptide (Recombinant protein") encoded by a recombinant 
polynucleotide. "Recombinant means" also encompass the ligation of nucleic acids 
having various coding regions or domains or promoter sequences from different 
15 sources into an expression cassette or vector for expression of, e.g., inducible or 
constitutive expression of a fusion protein comprising a translocation domain of the 
invention and a nucleic acid sequence amplified using a primer of the invention. 

As used herein, a "stable cell line" refers to a cell line, which stably, i.e. over a 
prolonged period, expresses a heterologous nucleic sequence, i.e. a T1R or G protein. 
20 In preferred embodiments, such stable cell lines will be produced by transfecting 
appropriate cells, typically mammalian cells, e.g. HEK-293 cells, with a linearized 
vector that contains a T1R expression construct, i.e. T1R1, T1R2 and/or T1R3. Most 
preferably, such stable cell lines will be produced by co-transfecting two linearized 
plasmids that express hTlRl and hTlR3 or hTlR2 and hTlR3 and an appropriate 
25 selection procedure to generate cell lines having these genes stably integrated therein. 
Most preferably, the cell line will also stably express a G protein such as G a \s. 

The phrase "selectively (or specifically) hybridizes to" refers to the binding, 
duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under 
stringent hybridization conditions when that sequence is present in a complex mixture 
3 0 (e.g. , total cellular or library DNA or RNA). 

The phrase "stringent hybridization conditions" refers to conditions under 
which a probe will hybridize to its target subsequence, typically in a complex mixture 
of nucleic acid, but to no other sequences. Stringent conditions are sequence dependent 
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and will be different in different circumstances. Longer sequences hybridize 
specifically at higher temperatures. An extensive guide to the hybridization of nucleic 
acids is found in Tijssen, Techniques in Biochemistry and Molecular Biology - 
Hybridization with Nucleic Probes, "Overview of principles of hybridization and the 
5 strategy of nucleic acid assays" (1993). Generally, stringent conditions are selected to 
be about 5-10° C lower than the thermal melting point (Tm) for the specific sequence at 
a defined ionic strength pH. The Tm is the temperature (under defined ionic strength, 
pH, and nucleic concentration) at which 50% of the probes complementary to the target 
hybridize to the target sequence at equilibrium (as the target sequences are present in 

10 excess, at Tm, 50% of the probes are occupied at equilibrium). Stringent conditions 
will be those in which the salt concentration is less than about 1.0 M sodium ion, 
typically about 0.01 to 1 .0 M sodium ion concentration (or other salts) at pH 7.0 to 8.3 
and the temperature is at least about 30° C for short probes (e.g 9 10 to 50 nucleotides) 
and at least about 60° C for long probes (e.g., greater than 50 nucleotides). Stringent 

1 5 conditions may also be achieved with the addition of destabilizing agents such as 
formamide. For selective or specific hybridization, a positive signal is at least two 
times background, optionally 10 times background hybridization. Exemplary stringent 
hybridization conditions can be as following: 50% formamide, 5x SSC, and 1% SDS, 
incubating at 42°C, or, 5x SSC, 1% SDS, incubating at 65°C, with wash in 0.2x SSC, 

20 and 0. 1 % SDS at 65°C. Such hybridizations and wash steps can be carried out for, e.g., 
1, 2, 5, 10, 15, 30, 60; or more minutes. 

Nucleic acids that do not hybridize to each other under stringent conditions are 
still substantially related if the polypeptides that they encode are substantially related. 
This occurs, for example, when a copy of a nucleic acid is created using the maximum 

25 codon degeneracy permitted by the genetic code. In such cases, the nucleic acids 
typically hybridize under moderately stringent hybridization conditions. Exemplary 
"moderately stringent hybridization conditions" include a hybridization in a buffer of 
40% formamide, 1 M NaCI, 1% SDS at 37°C, and a wash in IX SSC at 45°C. Such 
hybridizations and wash steps can be carried out for, e.g., 1, 2, 5, 10, 15, 30, 60, or 

30 more minutes. A positive hybridization is at least twice background. Those of 

ordinary skill will readily recognize that alternative hybridization and wash conditions 
can be utilized to provide conditions of similar stringency. 
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"Antibody" refers to a polypeptide comprising a framework region from an 
immunoglobulin gene or fragments thereof that specifically binds and recognizes an 
antigen. The recognized immunoglobulin genes include the kappa, lambda, alpha, 
gamma, delta, epsilon, and mu constant region genes, as well as the myriad 
5 immunoglobulin variable region genes. Light chains are classified as either kappa or 
lambda. Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in 
turn define the immunoglobulin classes, IgG, IgM, IgA, IgD and IgE, respectively. 

An exemplary immunoglobulin (antibody) structural unit comprises a tetramer. 
Each tetramer is composed of two identical pairs of polypeptide chains, each pair 
10 having one "light" (about 25 kDa) and one "heavy" chain (about 50-70 kDa). The N- 
terminus of each chain defines a variable region of about 100 to 1 10 or more amino 
acids primarily responsible for antigen recognition. The terms " variable light chain" 
(VL) and "variable heavy chain" (VH) refer to these light and heavy chains 
respectively. 

15 A "chimeric antibody" is an antibody molecule in which (a) the constant region, 

or a portion thereof, is altered, replaced or exchanged so that the antigen binding site 
(variable region) is linked to a constant region of a different or altered class, effector 
function and/or species, or an entirely different molecule which confers new properties 
to the chimeric antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or 
20 (b) the variable region, or a portion thereof, is altered, replaced or exchanged with a 
variable region having a different or altered antigen specificity. 

An "anti-TIR" antibody is an antibody or antibody fragment that specifically 
binds a polypeptide encoded by a T1R gene, cDNA, or a subsequence thereof. 

The term "immunoassay" is an assay that uses an antibody to specifically bind 
25 an antigen. The immunoassay is characterized by the use of specific binding properties 
of a particular antibody to isolate, target, and/or quantify the antigen. 

The phrase "specifically (or selectively) binds" or "specifically (or selectively) 
reacts with," when referring to a molecule or composition, refers to a binding reaction 
that is determinative of the presence of the molecule in a heterogeneous population of 
30 other biologies. Thus, under designated conditions, the specified molecules bind to a 
particular receptor at least two times the background and do not substantially bind in a 
significant amount to other molecules present in the sample. Specific binding to a 
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receptor under such conditions may require a receptor that is selected for its specificity 

for a particular molecule. 

Regarding antibodies, a variety of immunoassay formats may be used to select 

antibodies specifically immunoreactive with a particular protein. For example, solid- 
5 phase ELIS A immunoassays are routinely used to select antibodies specifically 

immunoreactive with a protein (see, e.g., Harlow & Lane, Antibodies, A Laboratory 

Manual, (1988), for a description of immunoassay formats and conditions that can be 
. used to determine specific immunoreactivity). Typically a specific or selective reaction 

will be at least twice background signal or noise and more typically more than 10 to 
10 100 times background. 

The phrase "selectively associates with" refers to the ability of a nucleic acid to 

"selectively hybridize" with another as defined above, or the ability of an antibody to 

"selectively (or specifically) bind to a protein, as defined above. 

The term "expression vector" refers to any recombinant expression system for 
1 5 the purpose of expressing a nucleic acid sequence of the invention in vitro or in vivo, 

constitutively or inducibly, in any cell, including prokaryotic, yeast, fungal, plant, 

insect or mammalian cell. The term includes linear or circular expression systems. The- 

term includes expression systems that remain episomal or integrate into the host cell 

genome. The expression systems can have the ability to self-replicate or not, i.e., drive 
20 only transient expression in a cell. The term includes recombinant expression 

"cassettes which contain only the minimum elements needed for transcription of the 

recombinant nucleic acid. 

By "host cell" is meant a cell that contains an expression vector and supports 

the replication or expression of the expression vector. Host cells may be prokaryotic 
25 cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, worm or 

mammalian cells such as CHO, Hela, HEK-293, and the like, e.g., cultured cells, 

explants, and cells in vivo. 

Compounds 

As discussed above, there are different domains on the T1R receptors. T1R1, 
30 T1R2, and T1R3 each contain an N-terminal extracellular domain (also known as the 
Venus flytrap domain), transmembrane domains comprising seven transmembrane 
regions, and corresponding cytoplasmic, and extracellular loops; a cysteine rich 
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domain, and a C-teirainal domain. Each region defines a specific set of compounds that 
bind specifically to that region. 

In humans, the N-terminal extracellular domain comprises amino acids 1 to 560 
of hTlR2 and amino acids 1 to 563 of hTlR3. In rats, the N-terminal extracellular 

5 domain comprises amino acids 1 to 564 of rTlR2, and amino acids 1 to 568 of rTlR3. 
In humans, the C-terminal transmembrane domain and intracellular domain 
comprise amino acids 561 to 839 of hTlR2, and amino acids 564 to 852 of hTlR3. In 
rats, the C-terminal transmembrane domain and intracellular domain comprise amino 
acids 565 to 842 of iT1R2, and amino acids 569 to 858 of rTlR3. 

1 0 Metabotropic glutamate receptors (mGluR) are another class of C-class G 

protein-coupled receptors that respond to glutamate. These are found primarily m the 
brain and neuronal tissue where they play a role in neuronal signaling. The mGluR N- 
terminal extracellular domain can be covalently linked to a T1R in order to create 
. chimeric receptors. The mGluR receptor can be any of mGluRl-mGluR8, for example- 

1 5 Different ligands bind to different domains on different subunits of both the umami and 
the sweet receptors. For example, aspartame and neotame bind to the N-terminal 
extracellular domain of T1R2, while cyclamate, neohesperidin dihydrochalcone 
(NHDC), and lactisole bind to the transmembrane domain of T1R3. Because T1R3 is 
one of the two subunits in the T1R1/T1R3 umami taste receptor, cyclamate, NHDC and 

20 lactisole can interact with T1R3 in the T1R1/T1R3 umami taste receptor as well. 

Cyclamate and NHDC enhance the activity of the umami taste receptor, while lactisole* 
inhibits the umami receptor. 

The specific binding compounds of the invention as it relates to umami tastants 
comprise amides. The amide compounds also comprise certain sub-classes of amide 

25 derivatives or classes of derivatives related to amides, such as for example ureas, 
urethanes, oxalamides, acrylamides, and the like. 

Molecules that interact with the transmembrane domain of T1R2, for example, 
can be modulators of sweet taste, and molecules that interact with the transmembrane 
domain of T1R3 can be modulators of sweet taste and/or umami taste. 

30 Human T1R2/T1R3 recognizes a group of sweeteners which are not recognized by rat 
T1R2/T1R3, and human but not rat T1R2/T1R3 is inhibited by lactisole. When the 
extracellular domain of human T1R2 was replaced by its rat counterpart, the human 
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receptor lost the ability to recognize aspartame, indicating that this part of human T1R2 
is required for binding to aspartame. Inversely, when the extracellular domain of rat 
T1R2 was replaced by its human counterpart, the rat receptor acquired the ability to 
recognize aspartame, indicating that this part of the human T1R2 is sufficient to bind 
5 aspartame. By the same principle, the transmembrane domain of human T1R3 was 
required and sufficient for 

Table 6 shows the abbreviations used to represent various rat/human chimeric 
receptors and receptor subunits. 
TABLE 6 

hT!R2- human T1R2 J 

hT!R3- human T1R3 , : 

rT!R2-ratTlR2 . 

rT!R3-ratT!R3 

hTlR2/iTlR3 - a receptor composed of human T1R2 and rat T1R3 

rTlR2/hTlR3 - a receptor composed of a rat T1R2 and human T1R3 

hTlR2/h3-r3 - a receptor composed of human T1R2 and a chimeric T1R3 with human N- 
terminal extracellular domain and rat transmembrane and C-terminal domain 

rTlR2/r3-h3 - a receptor composed of rat T1R2 and a chimeric T1R3 with rat N-terminal 
extracellular domain and human transmembrane and C-terminal domain 

h2-r2/rTlR3 - a receptor composed of a chimeric T1R2 with human N-terminal extracellular 
domain and rat transmembrane and C-terminal domain and rat T1R3 

r2-h2/rTlR3 - a receptor composed of a chimeric T1R2 with rat N-teiminal extracellular 
domain and human transmembrane and C-terminal domain and rat T1R3 

h2-hl/hTlR3 - a receptor composed of a chimeric T1R with human T1R2 N-terminal 
extracellular domain and human T1R1 transmembrane and C-tenninal domain and human 
T1R3 

hl-h2/hTlR3 - a receptor composed of a chimeric T1R with human T1R1 N-terminal 
extracellular domain and human T1R2 transmembrane and C-terminal domain and human 
T1R3 

h2-mGluRl/h3-mGluRl - a receptor composed of a N-terminal extracellular domain from 
hTlR2 covalently linked to the transmembrane and C-terminal domain of mGluRl and a 
N-terminal extracellular domain from hTlR3 covalently linked to the transmembrane and 
C-terminal domain of mGluRl 

hl-mGlulR/h3-mGluRl - a receptor composed of a N-tenninal extracellular domain from 
hTlRl covalently linked to the transmembrane and C-terminal domain of mGluRland a 
N-terminal extracellular domain from hT!R3 covalently linked to the transmembrane and 
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C-terminal domain of mGluRl 

mGluRl -h2/mGluRl -h3 - a receptor composed of a N-terminal extracellular domain from 
mGluRl covalently linked to the transmembrane and C-terminal domain of hTlR2 and a 
N-terminal extracellular domain from a mGluRl covalently linked to the transmembrane 
and C-terminal domain of hTlR3 



mGluRl -hl/mGluRl -h3 - receptor composed of a N-terminal extracellular domain from 
mGluRl covalently linked to the transmembrane and C-terminal domain of hTlRl and a 
N-terminal extracellular domain from mGluRl covalently linked to the transmembrane and 
C-terminal domain of hTlR3 



Disclosed herein are non-naturally occurring compounds that specifically bind 
to the T1R2/T1R3 receptor comprising hTlR2/hTlR3 but not rTlR2/rTlR3. Examples 
of such compounds include, but are not limited to neotame, aspartame, cyclamate, 
5 lactisol, Compound 883360, Compound 6542888, Compound 403249, Compound 
6364395, Dihydroxybenzoic acid (DHB), Compound 6542888, and neohesperidine 
dihydrochalcone (NHDC) Additional examples are found in Tables 1-4. The organic, 
non-peptide compounds can be approximately the size of a box of dimensions 15x8x8 
angstroms, more preferably the dimension should be 12x5x5 angstroms. 
10 Also disclosed are compounds that specifically bind to a T1R2/T1R3 receptor 

comprising hTlR2/rTlR3 but not rTlR2/hTlR3. Examples of such compounds 
include, but are not limited to aspartame, and neotame. Additional examples are found 
in Table 5. 

Also disclosed are compounds that specifically bind to the N-terminal 
15 extracellular domain of T1R2 of the hTlR2/hTlR3 receptor. Examples of such 

compounds include, but are not limited to neotame, aspartame carbohydrate sugars (e.g. 
sucrose, fructose, glucose, tagatose, erythritol, sorbitol, maltose, xylitol, lactose and 
galactose, as well as all other carbohydrate sugars). Additional examples are found in 
Tables. 

20 Also disclosed are compounds that specifically bind to the Venus Flytrap 

Domain (VFD) of T1R2 of the hTlR2/hTlR3 and hTlR2/rTlR3 receptor. 

Also disclosed are compounds that specifically bind to the N-terminal Venus 
flytrap domain of the T1R2 subunit of the T1R2/T1R3 receptor. More specifically, also 
disclosed are compounds that specifically bind to amino acid residues 144 and 302 of 

25 the human N-terminal Venus flytrap domain oftheTlR2 subunit of theTlR2/TlR3 
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receptor. Examples of such compounds include, but are not limited to aspartame, 
neotame, carbohydrates, and sweet amino acids, such as D-Trp, Ala, and Gly. 

Also disclosed are compounds that specifically bind to the cysteine-ricb region 
of T1R2 of the hTlR2/hTlR3 receptor. Also disclosed are compounds that specifically 

5 bind to the Transmembrane Domain (TM) of T1R2 of the hTlR2/hTlR3 receptor. 

Also disclosed are compounds that specifically bind to a T1R2/T1R3 receptor 
comprising rTlR2/hTlR3 but not hTlR2/rTlR3. Examples of such compounds 
include, but are not limited to cyclamate, NHDC, lactisole, Compound 883360, 
Compound 403249, and Compound 6364395. Additional examples are found in Table 

0 5. 

Also disclosed are compounds that specifically bind to hTlR2/hTlR3 and 
rTlR2/r3-h3 but not to rTlR2/rTlR3 or to hTlR2/h3-r3. Examples of such compounds 
include, but are not limited to cyclamate, NHDC, lactisole, Compound 883360, 
Compound 403249 and Compound 6364395. 
5 Also disclosed are compounds that specifically bind to extracellular loop 2 and 

extracellular loop 3 of the human C-terminal domain of the T1R3 subunit of 
theTlR2/TlR3 receptor. Also disclosed are compounds that specifically bind to 
hTlR2/hTlR3 and r2-h2/iTlR3 but not to rTlR2/rTlR3 or to h2-r2/hTlR3. 

Also disclosed are compounds that specifically bind to the human N-terminal 
0 extracellular domain of the T1R3 subunit of the T1R2/T1R3 receptor. Also disclosed 
, are compounds that specifically bind to the Venus Flytrap Domain (VFD) of T1R3 of 
the hTlR2/hTlR3 receptor. Examples of such compounds include, but are not limited 
to aspartame, neotame, carbohydrates, and sweet amino acids, such as D-Trp, Ala, and 
Gly. 

5 Also disclosed are compounds that specifically bind to the Transmembrane 

Domain of T1R3 of the hTlR2/hTlR3 receptor. Also disclosed are compounds that 
specifically bind to extracellular loop 2 and extracellular loop 3 of the human 
transmembrane domain of the T1R3 subunit of T1R2/T1R3. Examples of such 
compounds include, but are not limited to cyclamate. 

0 The compound of the invention does not include sucrose, fructose, glucose, 

erythritol, isomalt, lactitol, mannitol, sorbitol, xylitol, certain known natural terpenoids, 
flavonoids, or protein sweeteners, di-peptides, tri-peptides, aspartame, saccharin, 
sucralose, halogenated saccharides, acesulfame-K, cyclamate, sucralose, and alitame. 
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neotame, perillartine, SC-45647, SC-40014, monellin, NC-002740-01, thaumatin, CC- 
00100, NC-00420, alitame, SC-44102, dulcin, NC-00576, slycyrrhizic Acid, stevioside, 
Na-Saccharin, D-tryptophan, cyclamate, DHB, glycolic Acid, glycine, D (-)fructose, 
homofuronol, D (-) tagatose, maltose, D (+) glucose, D-sorbitol, D (+) galactose, a- 
5 lactose, LOfructose, L (+) Compound 403249, and glucose. 

Optionally, a compound of the invention is also not Compound 6364395. 
Also disclosed herein are compounds that bind a truncated region of a T1R 
domain. For example, disclosed are compounds that specifically bind to the TM domain 
of T1R2 of a truncated sweet receptor comprising h2TM/h3TM, compounds that 

1 0 specifically bind to the TM domain of T1R3 of a truncated sweet receptor comprising 
h2TM/h3TM, compounds that specifically bind to the TM domain of T1R2 of a 
chimeric receptor comprising mGluR-h2/mGluR-h3, compounds that specifically bind 
to the TM domain of T1R3 of a chimeric receptor comprising mGluR-h2/mGluR-h3, 
compounds that binds to the TM domain of T1R1 of a truncated savory receptor 

15 comprising hlTM/h3TM, compound that binds to the TM domain of T1R3 of a 
truncated sweet receptor comprising hlTM/h3TM, compounds that bind to the TM 
domain of T1R1 of a chimeric receptor comprising mGluR-hl/mGluR-h3, and 
compounds that bind to the TM domain of T1R3 of a chimeric receptor comprising 
mGluR-hl/mGluR-h3. SEQ ID NOS: 29-33 represent these truncated receptors. 

20 The compounds of the invention do not include monosodium glutamate 

("MSG"), inosine monophosphate (IMP) or guanosine monophosphate (GMP), sucrose, 
fructose, glucose, erythritol, isomalt, lactitol, mannitol, sorbitol, xylitol, certain known 
natural terpenoids, flavonoids, or protein sweeteners, di-peptides, tri-peptides 
aspartame, saccharin, sucralose, halogenated saccharides, acesulfame-K, cyclamate, 

25 sucralose, alitame, monosodium glutamate ('MSG"), inosine monophosphate (IMP) or 
guanosine monophosphate (GMP), or adenosine monophospate. 

Compound 403249 is (5-(4H-benzo[d][l,3]oxathiin-2-yl)-2-methyoxyphenol, 
while Compound 6364395 is 3-(3-hydroxy-4-methoxyphenethyl)benzo[d]isoxyazole- 
4,6-diol. 

30 The compounds described above can demonstrate a compound-dependent 

increase in fluorescence with an activity compared to the maximal activity for fructose 
of at least 25% in a fluorescence-based assay using a FLIPR instrument (Fluorometric 
Intensity Plate Reader, Molecular Devices, Sunnyvale, CA). For examples of this 
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protocol, see Examples 12 and 18. The compounds can also demonstrate a compound- 
dependent decrease in the EC50 for a sweetener by at least two-fold in a fluorescence- 
based assay using a FL1PR (Molecular Devices) instrument Furthermore, in a cell- 
based assay, the compound can result in at least 10 out of 100 cells transfected with 

5 wild-type or chimeric receptor showing a compound-dependent increase in 

fluorescence. An example of a cell-based assay can be found in Example 24. The 
compound can also demonstrate a compound-dependent increase of at least 1.1,1.2, 
1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, two-fold or greater, or any number in between, in the 
number of fluorescent cells in response to a sub-maximal level of a sweetener. The 

10 response can be measured by fluorescence, calcium levels, IP3 levels, cAMP levels, 
GTPyS binding, or reporter gene activity (e.g. luciferase, beta-galactosidase). 

Furthermore, the compounds disclosed herein can have one or more of the 
following characteristics in a cell: a decreased EC50 compared to a control of at least 
approximately 50%, increased intracellular Ca 2+ level by at least approximately 25%, 

1 5 increased intracellular cAMP by at least approximately 25%, increased intracellular 
cGMP by at least approximately 25%, increased intracellular IP3 by at least 
approximately 25%, or increased G protein binding of GTPyS by at least approximately 
25%. 

20 Methods of Using the Compounds 

Also disclosed are methods modulating the savory taste of a comestible or 
medicinal product comprising providing at least one comestible or medicinal product, 
or a precursor thereof, and combining the comestible or medicinal product or precursor 
thereof with at least a savory flavor modulating amount of at least one non-naturally 

25 occurring compound as disclosed herein, or a comestibly acceptable salt thereof, so as 
to form a modified comestible or medicinal product; thereby modulating the savory 
taste of a comestible or medicinal product. 

Also disclosed are methods for inhibiting the savory taste of a comestible ot 
medicinal product comprising providing at least one comestible or medicinal product, 

30 or a precursor thereof, and combining the comestible or medicinal product or precursor 
thereof with at least a savory flavor inhibiting amount of at least one non-naturally 
occurring compound as disclosed herein, or a comestibly acceptable salt thereof, so as 
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to form a modified comestible or medicinal product; thereby inhibiting the savory taste 
of a comestible or medicinal product 

Also disclosed are methods for increasing the savory taste of a comestible or 
medicinal product comprising providing at least one comestible or medicinal product, 
5 or a precursor thereof, and combining the comestible or medicinal product or precursor 
thereof with at least a savory flavor increasing amount of at least one non-naturally 
occurring compound as disclosed herein, or a comestibly acceptable salt thereof, so as 
to form a modified comestible or medicinal product; thereby increasing the savory taste 
of a comestible or medicinal product. 

10 

Also disclosed are methods for modulating the sweet taste of a comestible or 
medicinal product comprising providing at least one comestible or medicinal product, 
or a precursor thereof, and combining the comestible or medicinal product or precursor 
thereof with at least a sweet flavor modulating amount of at least one non-naturally 

15 occurring compound as disclosed herein, or a comestibly acceptable salt thereof, so as 
to form a modified comestible or medicinal product; thereby modulating the sweet taste 
of a comestible or medicinal product. 

Also disclosed are methods for inhibiting the sweet taste of a comestible or 
medicinal product comprising providing at least one comestible or medicinal product, 

20 or a precursor thereof, and combining the comestible or medicinal product or precursor 
thereof with at least a sweet flavor inhibiting amount of at least one non-naturally 
occurring compound as disclosed herein, or a comestibly acceptable salt thereof, so as 
to form a modified comestible or medicinal product; thereby inhibiting the sweet taste 
of a comestible or medicinal product. 

25 Also disclosed are methods for increasing the sweet taste of a comestible or 

medicinal product comprising providing at least one comestible or medicinal product, 
or a precursor thereof, and combining the comestible or medicinal product or precursor 
thereof with at least a sweet flavor increasing amount of at least one non-naturally 
occurring compound as disclosed herein, or a comestibly acceptable salt thereof, so as 

30 to form a modified comestible or medicinal product; thereby increasing the sweet taste 
of a comestible or medicinal product. 

Also disclosed are methods of enhancing umami taste perception comprising 
contacting an umami receptor with cyclamate and NHDC, and their derivatives, as well 
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as methods of enhancing umami taste perception comprising contacting an umami 
receptor with lactisole derivatives. Also disclosed are methods of enhancing sweet taste 
perception comprising contacting an sweet receptor with cyclamate and NHDC, and 
their derivatives. Also disclosed are methods of enhancing sweet taste perception 
5 comprising contacting an sweet receptor with lactisole derivatives. 

Isolation and Expression ofTIR Polypeptides 

Isolation and expression of the TIRs, or fragments or variants thereof, of the 
invention can be performed as described below. PCR primers can be used for the 
amplification of nucleic acids encoding taste receptor ligand-binding regions, and 

1 0 libraries of these nucleic acids can optionally be generated. Individual expression 
vectors or libraries of expression vectors can then be used to infect or transfect host 
cells for the functional expression of these nucleic acids or libraries. These genes and 
vectors can be made and expressed in vitro or in vivo. One of skill will recognize that 
desired phenotypes for altering and controlling nucleic acid expression can be obtained 

1 5 by modulating the expression or activity of the genes and nucleic acids (e.g. , promoters, 
enhancers and the like) within the vectors of the invention. Any of the known methods 
described for increasing or decreasing expression or activity can be used. The 
invention can be practiced in conjunction with any method or protocol known in the art, 
which are well described in the scientific and patent literature. 
20 The nucleic acid sequences of the invention and other nucleic acids used to 

practice this invention, whether RNA, cDNA, genomic DNA, vectors, viruses or 
hybrids thereof, may be isolated from a variety of sources, genetically engineered, 
amplified, and/or expressed recombinantly. Any recombinant expression system can be 
used, including, in addition to mammalian cells, e.g., bacterial, yeast, insect, or plant 
25 systems. 

Alternatively, these nucleic acids can be synthesized in vitro by well-known 
chemical synthesis techniques, as described in, e.g. , Carruthers, Cold Spring Harbor 
Symp. Quant. Biol. 47:411-418 (1982); Adams, Am. Chem. Soc. 105:661 (1983); 
Belousov, Nucleic Acids Res. 25:3440-3444 (1997); Frenkel, Free Radic. Biol. Med. 
30 19:373-380 (1995); Blommers, Biochemistry 33:7886-7896 (1994); Narang, Meth. 
Enzymol 68:90 (1979); Brown, Meth. Enzymol. 68:109 (1979); Beaucage, Tetra. Lett. 
22:1859 (1981); U.S. Patent No. 4,458,066. Double-stranded DNA fragments may 
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then be obtained either by synthesizing the complementary strand and annealing the 
strands together under appropriate conditions, or by adding the complementary strand 
using DNA polymerase with an appropriate primer sequence. 

Techniques for the manipulation of nucleic acids, such as, for example, for 
5 generating mutations in sequences, subcloning, labeling probes, sequencing, 

hybridization and the like are well described in the scientific and patent literature. See, 
e.g., Sambrook, ed., Molecular Cloning: a Laboratory manual (2nd ed.), Vols. 1-3, Cold 
Spring Harbor Laboratory (1 989); Current Protocols in Molecular Biology, Ausubel, 
ed. John Wiley & Sons, Inc., New York (1997); Laboratory Techniques in 

1 0 Biochemistry and Molecular Biology: Hybridization With Nucleic Acid Probes, Part I, 
Theory and Nucleic Acid Preparation, Tijssen, ed. Elsevier, N.Y. (1993). 

Nucleic acids, vectors, capsids, polypeptides, and the like can be analyzed and 
quantified by any of a number of general means well known to those of skill in the art. 
These include, e.g., analytical biochemical methods such as NMR, spectrophotometry, 

15 radiography, electrophoresis, capillary electrophoresis, high performance liquid 
chromatography (HPLC), thin layer chromatography (TLC), and hyperdifiusion 
chromatography, various immunological methods, e.g, fluid or gel precipitin reactions, 
immunodiffusion, Immunoelectrophoresis, radioimmunoassays (RIAs), enzyme-linked 
immunosorbent assays (ELISAs), immuno-fluorescent assays, Southern analysis, 

20 Northern analysis, dot-blot analysis, gel electrophoresis (e.g. , SDS-PAGE), RT-PCR, 
quantitative PCR, other nucleic acid or target or signal amplification methods, 
radiolabeling, scintillation counting, and affinity chromatography. 

Oligonucleotide primers may be used to amplify nucleic acid fragments 
encoding taste receptor ligand-binding regions. The nucleic acids described herein can 

25 also be cloned or measured quantitatively using amplification techniques. 

Amplification methods are also well known in the art, and include, e.g., polymerase 
chain reaction, PCR (PCR Protocols, a Guide to Methods and Applications, ed. Innis. 
Academic Press, N.Y. (1990) and PCR Strategies, ed. Innis, Academic Press, Inc., N.Y. 
(1995), ligase chain reaction (LCR) (see, e.g, Wu, Genomics 4:560 (1989); Landegren, 

30 Science 241 :1077, (1988); Barringer, Gene 89:1 17 (1990)); transcription amplification 
(see, e.g, Kwoh, Proc. Natl Acad. Set USA 86:1173 (1989)); and, self-sustained 
sequence replication (see, e.g, Guatelli, Proc. Natl Acad. Sci. USA 87:1874 (1990)); Q 
Beta replicase amplification (see, e.g., Smith, J. Clin. Microbiol 35:1477-1491 (1997)); 
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automated Q-beta replicase amplification assay (see, e.g, Burg, Mol Cell Probes 
10:257-271 (1996)) and other RNA polymerase mediated techniques (e.g., NASBA, 
Cangene, Mississauga, Ontario); see also Berger, Methods Enzymol 152:307-316 
(1987); Sambrook; Ausubel; U.S. Patent Nos. 4,683,195 and 4,683,202; Sooknanan, 

5 Biotechnology 13:563-564 (1995). The primers can be designed to retain the original 
sequence of the "donor" 7-membrane receptor. Alternatively, the primers can encode 
amino acid residues that are conservative substitutions (e.g. 9 hydrophobic for 
hydrophobic residue, see above discussion) or functionally benign substitutions (e.g., 
do not prevent plasma membrane insertion, cause cleavage by peptidase, cause 

0 abnormal folding of receptor, and the like). Once amplified, the nucleic acids, either 
individually or as libraries, may be cloned according to methods known in the art, if 
desired, into any of a variety of vectors using routine molecular biological methods; 
methods for cloning in vitro amplified nucleic acids are described, e.g., U.S. Pat. No. 
5,426,039. 

5 The primer pairs may be designed to selectively amplify ligand-binding regions 

oftheTIR family members. These regions may vary for different ligands or tastants. 
Thus, what may be a minimal binding region for one tastant, may be too limiting for a 
second tastant. Accordingly, ligand-binding regions of different sizes comprising 
different extracellular domain structures may be amplified 

0 Paradigms to design degenerate primer pairs are well known in the art. For 

example, a COnsensus-DEgenerate Hybrid Oligonucleotide Primer (CODEHOP) 
strategy computer program is accessible as http://blocks.fhcrc.org/codehop.html, and is 
directly linked from the BlockMaker multiple sequence alignment site for hybrid 
primer prediction beginning with a set of related protein sequences, as known taste 

5 receptor ligand-binding regions (see, e.g. 9 Rose, Nucleic Acids Res. 26:1628-1635 
(1998); Singh, Biotechniques 24:318-319 (1998)). 

Means to synthesize oligonucleotide primer pairs are well known in the art. 
''Natural" base pairs or synthetic base pairs can be used. For example, use of artificial 
nucleobases offers a versatile approach to manipulate primer sequence and generate a 

0 more complex mixture of amplification products. Various families of artificial 

nucleobases are capable of assuming multiple hydrogen bonding orientations through 
internal bond rotations to provide a means for degenerate molecular recognition. 
Incorporation of these analogs into a single position of a PCR primer allows for 
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generation of a complex library of amplification products. See, e.g., Hoops, Nucleic 
Acids Res. 25:4866-4871 (1997). Nonpolar molecules can also be used to mimic the 
shape of natural DNA bases. A non-hydrogen-bonding shape mimic for adenine can 
replicate efficiently and selectively against a nonpolar shape mimic for thymine (see, 

5 e.g., Morales, Nat Struct Biol 5:950-954 (1998)). For example, two degenerate bases 
can be the pyrimidine base 6H, 8H-3,4-dihydropyrimido[4,5-c][l,2]oxazin-7-one or the 
purine base N6-methoxy-2,6-diaminopurine (see, e.g., Hill, Proc. Natl. Acad Set USA 
95:4258-4263 (1998)). Exemplary degenerate primers of the invention incorporate the 
nucleobase analog 5'-Dimethoxytrityl-N-benzoyl-2 , -deoxy-Cytidine,3 , -[(2- 

1 0 cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite (the term "P" in the sequences, see 
above). This pyrimidine analog hydrogen bonds with purines, including A and G 
residues. 

Polymorphic variants, alleles, and interspecies homologs that are substantially 
identical to a taste receptor disclosed herein can be isolated using the nucleic acid 
15 probes described above. Alternatively, expression libraries can be used to clone T1R 
polypeptides and polymorphic variants, alleles, and interspecies homologs thereof, by 
detecting expressed homologs immunologically with antisera or purified antibodies 
made against a T1R polypeptide, which also recognize and selectively bind to the T1R 
homolog. 

20 Nucleic acids that encode ligand-binding regions of taste receptors may be 

generated by amplification (e.g., PCR) of appropriate nucleic acid sequences using 
degenerate primer pairs. The amplified nucleic acid can be genomic DNA from any 
cell or tissue or mRNA or cDNA derived from taste receptor-expressing cells. 

In one embodiment, hybrid protein-coding sequences comprising nucleic acids 

25 encoding TIRs fused to translocation sequences may be constructed. Also provided are 
hybrid TIRs comprising the translocation motifs and tastant-binding domains of other 
families of chemosensory receptors, particularly taste receptors. These nucleic acid 
sequences can be operably linked to transcriptional or translational control elements, 
e.g., transcription and translation initiation sequences, promoters and enhancers, 

30 transcription and translation terminators, polyadenylation sequences, and other 
sequences useful for transcribing DNA into RNA. In constitutive of recombinant 
expression cassettes, vectors, and transgenics, a promoter fragment can be employed to 
direct expression of the desired nucleic acid in all desired cells or tissues. 
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In another embodiment, fusion proteins may include C-terminal or N-terminal 
translocation sequences. Further, fusion proteins can comprise additional elements, 
e.g., for protein detection, purification, or other applications. Detection and 
purification facilitating domains include, e.g., metal chelating peptides such as 
5 polyhistidine tracts, histidine-tryptophan modules, or other domains that allow 

purification on immobilized metals; maltose binding protein; protein A domains that 
allow purification on immobilized immunoglobulin; or the domain utilized in the 
FLAGS extension/affinity purification system (Immunex Corp., Seattle, WA). 

The inclusion of a cleavable linker sequences such as Factor Xa (see, e.g., 
10 Ottavi, Biochimie 80:289-293 (1998)), subtilisin protease recognition motif (see, e.g., 
Polyak, Protein Eng. 10:615-619 (1997)); enterokinase (Invitrogen, San Diego, CA), 
and the like, between the translocation domain (for efficient plasma membrane 
expression) and the rest of the newly translated polypeptide may be useful to facilitate 
purification. For example, one construct can include a polypeptide encoding a nucleic 
15 acid sequence linked to six histidine residues followed by a thioredoxin, an 

enterokinase cleavage site (see, e.g., Williams, Biochemistry 34:1787-1797 (1995)), 
and an C-terminal translocation domain. The histidine residues facilitate detection and 
purification while the enterokinase cleavage site provides a means for purifying the 
desired protein(s) from the remainder of the fusion protein. Technology pertaining to 
20 vectors encoding fusion proteins and application of fusion proteins are well described 
in the scientific and patent literature, see, e.g., Kroll, DNA Cell. Biol. 12:441-53 (1993). 

Expression vectors, either as individual expression vectors or as libraries of 
expression vectors, comprising the ligand-binding domain encoding sequences may be 
introduced into a genome or into the cytoplasm or a nucleus of a cell and expressed by 
25 a variety of conventional techniques, well described in the scientific and patent 

literature. See, e.g., Roberts, Nature 328:731 (1987); Berger supra; Schneider, Proton 
Expr. Purif. 6435:10 (1995); Sambrook; Tijssen; Ausubel. Product information from 
manufacturers of biological reagents and experimental equipment also provide 
information regarding known biological methods. The vectors can be isolated from 
30 natural sources, obtained from such sources as ATCC or GenBank libraries, or prepared 
by synthetic or recombinant methods. 

The nucleic acids can be expressed using expression cassettes, vectors or 
viruses which are stably or transiently expressed in cells (e.g. , episomal expression 
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systems). Selection markers can be incorporated into expression cassettes and vectors 
to confer a selectable phenotype on transformed cells and sequences. For example, 
selection markers can code for episomai maintenance and replication such that 
integration into the host genome is not required. For example, the marker may encode 
5 antibiotic resistance {e.g, chloramphenicol, kanamycin, G418, blasticidin, hygromycin) 
or herbicide resistance (e.g., chlorosulfiiron or Basta) to permit selection of those cells 
transformed with the desired DNA sequences {see, e.g., Blondelet-Rouault, Gene 
190:315-317 (1997); Aubrecht, J. Pharmacol Exp. Ther. 281:992-997 (1997)). 
Because selectable marker genes conferring resistance to substrates like neomycin or 

10 hygromycin can only be utilized in tissue culture, chemoresistance genes are also used 
as selectable markers in vitro and in vivo. 

A chimeric nucleic acid sequence may encode a T 1R ligand-binding domain 
within any 7-transmembrane polypeptide. Because 7-transmembrane receptor 
polypeptides have similar primary sequences and secondary and tertiary structures, 

15 structural domains {e.g., extracellular domain, TM domains, cytoplasmic domain, etc.) 
can be readily identified by sequence analysis. For example, homology modeling, 
Fourier analysis and helical periodicity detection can identify and characterize the 
seven domains with a 7-transmembrane receptor sequence. Fast Fourier Transform 
(FFT) algorithms can be used to assess the dominant periods that characterize profiles 

20 of the hydrophobicity and variability of analyzed sequences. Periodicity detection 
enhancement and alpha helical periodicity index can be done as by, e.g., Donnelly, 
Protein Set 2:55-70 (1993). Other alignment and modeling algorithms are well known 
in the art, see, e.g., Peitsch, Receptors Channels 4:161-164 (1996); Kyte & Doolittle, J. 
Med. Bio., 157:105-132 (1982); Cronet, Protein Eng. 6:59-64 (1993). 

25 The present invention also includes not only the DNA and proteins having the 

specified nucleic and amino acid sequences, but also DNA fragments, particularly 
fragments of, e.g., 40, 60, 80, 100, 150, 200, or 250 nucleotides, or more, as well as 
protein fragments of, e.g., 10, 20, 30, 50, 70, 100, or 150 amino acids, or more. 
Optionally, the nucleic acid fragments can encode an antigenic polypeptide, which is 

30 capable of binding to an antibody raised against a T1R family member. Further, a 
protein fragment of the invention can optionally be an antigenic fragment, which is 
capable of binding to an antibody raised against a T1R family member. 



102 



WO 2005/015158 



PCT/US2004/025459 



Also contemplated are chimeric proteins, comprising at least 10, 20, 30, 50, 70, 
100, or 150 amino acids, or more, of one of at least one of the T1R polypeptides 
described herein, coupled to additional amino acids representing all or part of another 
GPCR, preferably a member of the 7 transmembrane superfamily. These chimeras can 
5 be made from the instant receptors and another GPCR, or they can be made by 

combining two or more of the present T1R receptors. In one embodiment, one portion 
of the chimera coiresponds to or is derived from the extracellular domain of a T1R 
polypeptide of the invention. In another embodiment, one portion of the chimera 
corresponds to, or is derived from the extracellular domain and one or more of the 

10 transmembrane domains of a T1R polypeptide described herein, and the remaining 
portion or portions can come from another GPCR. Chimeric receptors are well known 
in the art, and the techniques for creating them and the selection and boundaries of 
domains or fragments of G protein-coupled receptors for incorporation therein are also 
well known. Thus, this knowledge of those skilled in the art can readily be used to 

15 create such chimeric receptors. The use of such chimeric receptors can provide, for 
example, a taste selectivity characteristic of one of the receptors specifically disclosed 
herein, coupled with the signal transduction characteristics of another receptor, such as 
a well known receptor used in prior art assay systems. 

As noted above, such chimeras, analogous to the native T1R receptor, or native 

20 T1R receptor combination or association will bind to and/or be activated by molecules 
that normally affect sweet taste or umami taste. Functional chimeric T1R receptors or 
receptor combinations are molecules which when expressed alone or in combination 
with other TIRs or other GPCRs (which may themselves be chimeric) bind to or which 
are activated by taste stimuli, particularly sweet (T1R2/3) or umami taste stimuli 

25 (T1R1/3). Molecules that elicit sweet taste include natural and artificial sweeteners 
such as sucrose, aspartame, xylitol, cyclamate, et al., Molecules that elicit umami taste 
include glutamate and glutamate analogs and other compounds that bind to native T1R1 
and/or T1R3, such as 5* -nucleotides. 

For example, a domain such as a ligand-binding domain, an extracellular 

30 domain, a transmembrane domain, a transmembrane domain, a cytoplasmic domain, an 
N-terminal domain, a C-terminal domain, or any combination thereof, can be 
covalently linked to a heterologous protein. For instance, an T1R extracellular domain 
can be linked to a heterologous GPCR transmembrane domain, or a heterologous 
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GPCR extracellular domain can be linked to a T1R transmembrane domain. Other 
heterologous proteins of choice can be used; e.g., green fluorescent protein. 

Also within the scope of the invention are host cells for expressing the TIRs, 
fragments, chimeras or variants of the invention. To obtain high levels of expression of 
5 a cloned gene or nucleic acid, such as cDNAs encoding the TIRs, fragments, or 

variants of the invention, one of skill typically subclones the nucleic acid sequence of 
interest into an expression vector that contains a strong promoter to direct transcription, 
a transcription/translation terminator, and if for a nucleic acid encoding a protein, a 
ribosome binding site for translational initiation. Suitable bacterial promoters are well 

10 known in the art and described, e.g., in Sambrook et al. However, bacterial or 
eukaryotic expression systems can be used. 

Any of the well-known procedures for introducing foreign nucleotide sequences 
into host cells may be used. These include the use of calcium phosphate transfection, 
polybrene, protoplast fusion, electroporation, liposomes, microinjection, plasma 

1 5 vectors, viral vectors and any of the other well known methods for introducing cloned 
genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell 
(see, e.g.. Sambrook et al) It is only necessary that the particular genetic engineering 
procedure used be capable of successfully introducing at lest one nucleic acid molecule 
into the host cell capable, of expressing the T1R, fragment, or variant of interest. 

20 After the expression vector is introduced into the cells, the transfected cells are 

cultured under conditions favoring expression of the receptor, fragment, or variant of 
interest, which is then recovered from the culture using standard techniques. Examples 
' of such techniques are well known in the art. See, e.g., WO 00/06593, which is 
incorporated by reference in a manner consistent with this disclosure. 

25 

Detection of T1R polypeptides 

In addition to the detection of T1R genes and gene expression using nucleic 
acid hybridization technology, one can also use immunoassays to detect TIRs, e.g., to 
identify taste receptor cells, and variants of T1R family members. Immunoassays can 
30 be used to qualitatively or quantitatively analyze the TIRs. A general overview of the 
applicable technology can be found in Harlow & Lane, Antibodies: A Laboratory 
Manual (1988). 
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1. Antibodies to T1R family members 

Methods of producing polyclonal and monoclonal antibodies that react 
specifically with a T1R family member are known to those of skill in the art {see, e.g., 
Coligan, Current Protocols in Immunology (1991); Harlow & Lane, supra; Goding, 
5 Monoclonal Antibodies: Principles and Practice (2d ed. 1 986); and Kohler & Milstein, 
Nature, 256:495-497 (1 975)). Such techniques include antibody preparation by 
selection of antibodies from libraries of recombinant antibodies in phage or similar 
vectors, as well as preparation of polyclonal and monoclonal antibodies by immunizing 
rabbits or mice {see t e.g., Huse et al, Science, 246:1275-1281 (1989); Ward et al, 
10 Afa/wr041:544-546(1989)). 

A number of TIR-comprising immunogens may be used to produce antibodies 
specifically reactive with a T1R family member. For example, a recombinant T1R 
polypeptide, or an antigenic fragment thereof, can be isolated as described herein. 
Suitable antigenic regions include, e.g., the consensus sequences that are used to 
15 identify members of the T1R family. Recombinant proteins can be expressed in 

eukaryotic or prokaryotic cells as described above, and purified as generally described 
above. Recombinant protein is the preferred immunogen for the production of 
monoclonal or polyclonal antibodies. Alternatively, a synthetic peptide derived from 
the sequences disclosed herein and conjugated to a carrier protein can be used an 
20 immunogen. Naturally occurring protein may also be used either in pure or impure 
form. The product is then injected into an animal capable of producing antibodies. 
Either monoclonal or polyclonal antibodies may be generated, for subsequent use in 
immunoassays to measure the protein. 

Methods of production of polyclonal antibodies are known to those of skill in 
25 the art. For example, an inbred strain of mice {e.g., BALB/C mice) or rabbits is 

immunized with the protein using a standard adjuvant, such as Freund's adjuvant, and a 
standard immunization protocol. The animal's immune response to the immunogen 
preparation is monitored by taking test bleeds and determining the titer of reactivity to 
the T1R. When appropriately high titers of antibody to the immunogen are obtained, 
30 blood is collected from the animal and antisera are prepared. Further fractionation of 
the antisera to enrich for antibodies reactive to the protein can be done if desired {see 
Harlow & Lane, supra). 
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Monoclonal antibodies may be obtained by various techniques familiar to those 
skilled in the art. Briefly, spleen cells from an animal immunized with a desired 
antigen may be immortalized, commonly by fusion with a myeloma cell (see Kohler & 
Milstein, Eur. J. Immunol, 6:51 1-519 (1976)). Alternative methods of immortalization 
5 include transforation with Epstein Barr Virus, oncogenes, or retroviruses, or other 
methods well known in the art. Colonies arising from single immortalized cells are 
screened for production of antibodies of the desired specificity and affinity for the 
antigen, and yield of the monoclonal antibodies produced by such cells may be 
enhanced by various techniques, including injection into the peritoneal cavity of a 
10 vertebrate host. Alternatively, one may isolate DNA sequences which encode a 

monoclonal antibody or a binding fragment thereof by screening a DNA library from 
human B cells according to the general protocol outlined by Huse et al, Science, 
246:1275-1281 (1989). 

Monoclonal antibodies and polyclonal sera are collected and titered against the 
15 immunogen protein in an immunoassay, for example, a solid phase immunoassay with - 
the immunogen immobilized on a solid support. Typically, polyclonal antisera with a 
titer of 104 or greater are selected and tested for their cross reactivity against non-TIR 
polypeptides, or even other T1R family members or other related proteins from other 
organisms, using a competitive binding immunoassay. Specific polyclonal antisera and 
20 monoclonal antibodies will usually bind with a Kd of at least about 0. 1 mM, more 
usually at least about 1 pM, optionally at least about 0.1 pM or better, and optionally 
0.01 pM or better. 

Once T1R family member specific antibodies are available, individual T1R 
proteins and protein fragments can be detected by a variety of immunoassay methods. 
25 For a review of immunological and immunoassay procedures, see Basic and Clinical 
Immunology (Stites & Terr eds., 7th ed. 1991). Moreover, the immunoassays of the 
present invention can be performed in any of several configurations, which are 
reviewed extensively in Enzyme Immunoassay (Maggio, ed., 1980); and Harlow & 
Lane, supra. 

30 

2. Immunological binding assays 

T1R proteins, fragments, and variants can be detected and/or quantified using 
any of a number of well-recognized immunological binding assays (see, e.g., U.S. 
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Patents 4,366,241; 4,376,1 10; 4,517,288; and 4,837,168). For a review of the general 
immunoassays, see also Methods in Cell Biology: Antibodies in Cell Biology, volume 
37 (Asai, ed. 1993); Basic and Clinical Immunology (Stites & Terr, eds., 7th ed. 1991). 
Immunological binding assays (or immunoassays) typically use an antibody that 
5 specifically binds to a protein or antigen of choice (in this case a T1R family member 
or an antigenic subsequence thereof). The antibody (e.g, anti-TIR) may be produced 
by any of a number of means well known to those of skill in the art and as described 
above. 

Immunoassays also often use a labeling agent to specifically bind to and label 

10 the complex formed by the antibody and antigen. The labeling agent may itself be one 
of the moieties comprising the antibody/antigen complex. Thus, the labeling agent may 
be a labeled T1R polypeptide or a labeled anti-TIR antibody. Alternatively, the 
labeling agent may be a third moiety, such a secondary antibody that specifically binds 
to the antibody/TIR complex (a secondary antibody is typically specific to antibodies 

15 of the species from which the first antibody is derived). Other proteins capable of 
specifically binding immunoglobulin constant regions, such as protein A or protein G 
may also be used as the label agent. These proteins exhibit a strong non-immunogenic 
reactivity with immunoglobulin constant regions from a variety of species (see, e.g., 
Kronval et aL, J. Immunol, 1 1 1:1401-1406 (1973); Akerstrom et aL, J. Immunol., 

20 135:2589-2542 (1985)). The labeling agent can be modified with a detectable moiety, 
such as biotin, to which another molecule can specifically bind, such as streptavidin. A 
variety of detectable moieties are well known to those skilled in the art. 

Throughout the assays, incubation and/or washing steps maybe required after 
each combination of reagents. Incubation steps can vary from about 5 seconds to 

25 several hours, optionally from about 5 minutes to about 24 hours. However, the 
incubation time will depend upon the assay format, antigen, volume of solution, 
concentrations, and the like. Usually, the assays will be carried out at ambient 
temperature, although they can be conducted over a range of temperatures, such as 
10°Cto40°C 

30 

A. Non-competitive assay formats 
Immunoassays for detecting a T1R polypeptide in a sample may be either 
competitive or noncompetitive. Noncompetitive immunoassays are assays in which the 
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amount of antigen is directly measured. In one preferred "sandwich" assay, for 
example, the anti-TIR antibodies can be bound directly to a solid substrate on which 
they are immobilized. These immobilized antibodies then capture the T1R polypeptide 
present in the test sample. The T1R polypeptide is thus immobilized is then bound by a 
5 labeling agent, such as a second T1R antibody bearing a label. Alternatively, the 
second antibody may lack a label, but it may, in turn, be bound by a labeled third 
antibody specific to antibodies of the species from which the second antibody is 
derived. The second or third antibody is typically modified with a detectable moiety, 
such as biotin, to which another molecule specifically binds, e.g., streptavidin, to 
10 provide a detectable moiety. 

B. Competitive assay formats 
In competitive assays, the amount of T1R polypeptide present in the sample is 
measured indirectly by measuring the amount of a known, added (exogenous) T1R 

1 5 polypeptide displaced (competed away) from an anti-T 1R antibody by the unknown 
T1R polypeptide present in a sample. In one competitive assay, a known amount of 
T1R polypeptide is added to a sample and the sample is then contacted with an 
antibody that specifically binds to the T1R. The amount of exogenous T1R polypeptide 
bound to the antibody is inversely proportional to the concentration of T1R polypeptide 

20 present in the sample. In a particularly. preferred embodiment, the antibody is 
immobilized on a solid substrate. The amount of T1R polypeptide bound to the 
antibody may be determined either by measuring the amount of T1R polypeptide 
present in a TIR/antibody complex, or alternatively by measuring the amount of 
remaining uncomplexed protein. The amount of T1R polypeptide may be detected by 

25 providing a labeled T1R molecule. 

A hapten inhibition assay is another preferred competitive assay. In this assay 
the known T1R polypeptide is immobilized on a solid substrate. A known amount of 
anti-TIR antibody is added to the sample, and the sample is then contacted with the 
immobilized T1R. The amount of anti-TIR antibody bound to the known immobilized 

30 T1R polypeptide is inversely proportional to the amount of T1R polypeptide present in 
the sample. Again, the amount of immobilized antibody may be detected by detecting 
either the immobilized fraction of antibody or the fraction of the antibody that remains 
in solution. Detection may be direct where the antibody is labeled or indirect by the 
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subsequent addition of a labeled moiety that specifically binds to the antibody as 
described above. 

C. Cross-reactivity determinations 
5 Immunoassays in the competitive binding format can also be used for cross- 

reactivity determinations. For example, a protein at least partially encoded by the 
nucleic acid sequences disclosed herein can be immobilized to a solid support. Proteins 
(e.g., T1R polypeptides and homologs) are added to the assay that compete for binding 
of the antisera to the immobilized antigen. The ability of the added proteins to compete 

1 0 for binding of the antisera to the immobilized protein is compared to the ability of the 
T1R polypeptide encoded by the nucleic acid sequences disclosed herein to compete 
with itself. The percent cross-reactivity for the above proteins is calculated, using 
standard calculations. Those antisera with less than 10% cross-reactivity with each of 
the added proteins listed above are selected and pooled. The cross-reacting antibodies 

1 5 are optionally removed from the pooled antisera by immunoabsorption with the added 
considered proteins, e.g., distantly related homologs. In addition, peptides comprising 
amino acid sequences representing conserved motifs that are used to identify members 
of the T1R family can be used in cross-reactivity determinations. 

The immunoabsorbed and pooled antisera are then used in a competitive 

20 binding immunoassay as described above to compare a second protein, thought to be 
perhaps an allele or polymorphic variant of a T1R family member, to the immunogen 
protein (i.e., T1R polypeptide encoded by the nucleic acid sequences disclosed herein). 
In order to make this comparison, the two proteins are each assayed at a wide range of ' 
concentrations and the amount of each protein required to inhibit 50% of the binding of 

25 the antisera to the immobilized protein is determined. If the amount of the second 
protein required to inhibit 50% of binding is less than 10 times the amount of the 
protein encoded by nucleic acid sequences disclosed herein required to inhibit 50% of 
binding, then the second protein is said to specifically bind to the polyclonal antibodies 
generated to a T1R immunogen. 

30 Antibodies raised against T1R conserved motifs can also be used to prepare 

antibodies that specifically bind only to GPCRs of the T1R family, but not to GPCRs 
from other families. 
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Polyclonal antibodies that specifically bind to a particular member of the T1R 
family can be made by subtracting out cross-reactive antibodies using other T1R family 
members. Species-specific polyclonal antibodies can be made in a similar way. For 
example, antibodies specific to human T1R1 can be made by, subtracting out antibodies 
5 that are cross-reactive with orthologous sequences, e.g. 9 rat T1R1 or mouse T1R1. 

D. Other assay formats 
Western blot (immunoblot) analysis is used to detect and quantify the presence 
of T1R polypeptide in the sample. The technique generally comprises separating 

1 0 sample proteins by gel electrophoresis on the basis of molecular weight, transferring 
the separated proteins to a suitable solid support, (such as a nitrocellulose filter, a nylon 
filter, or derivatized nylon filter), and incubating the sample with the antibodies that 
specifically bind the T1R polypeptide. The anti-TIR polypeptide antibodies 
specifically bind to the T1R polypeptide on the solid support, These antibodies may be v 

1 5 directly labeled or alternatively may be subsequently detected using labeled antibodies • 
(eg., labeled sheep anti-mouse antibodies) that specifically bind to the anti-TIR 
antibodies. 

Other, assay formats include liposome immunoassays (LIA), which use 
liposomes designed to bind specific molecules (e.g., antibodies) and release 
20 encapsulated reagents or markers. The released chemicals are then detected according - * 
to standard techniques (see Monroe et al y Amer. Clin. Prod. Rev., 5:34-41 (1986)). 

E. Reduction of non-spe cific binding 

One of skill in the art will appreciate that it is often desirable to minimize non- 
25 specific binding in immunoassays. Particularly, where the assay involves an antigen or 
antibody immobilized on a solid substrate it is desirable to minimize the amount of 
non-specific binding to the substrate. Means of reducing such non-specific binding are 
well known to those of skill in the art. Typically, this technique involves coating the 
substrate with a proteinaceous composition. In particular, protein compositions such as 
30 bovine serum albumin (BSA), nonfat powdered milk, and gelatin are widely used with 
powdered milk being most preferred. 

F. Labels 
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The particular label or detectable group used in the assay is not a critical aspect 
of the invention, as long as it does not significantly interfere with the specific binding 
of the antibody used in the assay. The detectable group can be any material having a 
detectable physical or chemical property. Such detectable labels have been well 
5 developed in the field of immunoassays and, in general, most any label useful in such 
methods can be applied to the present invention. Thus, a label is any composition 
detectable by spectroscopic, photochemical, biochemical, immunochemical, electrical, 
optical, or chemical means. Useful labels in the present invention include magnetic 
beads (e.g., DYNABEADSTM), fluorescent dyes (e.g., fluorescein isothiocyanate, 

10 Texas red, rhodamine, and the like), radiolabels (e.g., 3 H, 125 I, l4 C, 35 S), enzymes (e.g, 
horseradish peroxidase, alkaline phosphates and others commonly used in an ELISA), 
and colorimetric labels such as colloidal gold or colored glass or plastic beads (e.g., 
polystyrene, polypropylene, latex, etc.). 

The label may be coupled directly or indirectly to the desired component of the 

15 assay according to methods well known in the art. As indicated above, a wide variety • 
of labels may be used, with the choice of label depending on sensitivity required, ease 
of conjugation with the compound, stability requirements, available instrumentation, 
and disposal provisions. 

Non-radioactive labels are often attached by indirect means. Generally, a ligand 

20 molecule (e.g. , biotin) is coyalently bound to the molecule. The ligand then binds to 
another molecules (e.g., streptavidin) molecule, which is either inherently detectable or 
covalently bound to a signal system, such as a detectable enzyme, a fluorescent 
compound, or a chemiluminescent compound. The ligands and their targets can be 
used in any suitable combination with antibodies that recognize a T1R polypeptide, or 

25 secondary antibodies that recognize anti-TlR. 

The molecules can also be conjugated directly to signal generating compounds, 
e-g- > by conjugation with an enzyme or fluorophore. Enzymes of interest as labels will 
primarily be hydrolases, particularly phosphatases, esterases and glycosidases, or 
oxidotases, particularly peroxidases. Fluorescent compounds include fluorescein and 

30 its derivatives, rhodamine and its derivatives, dansyl, umbelliferone, etc. 

Chemiluminescent compounds include luciferin, and 2,3-dihydrophthalazinediones, 
e.g., luminol. For a review of various labeling or signal producing systems that may be 
used, see U.S. Patent No. 4,391,904. 
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Means of detecting labels are well known to those of skill in the art. Thus, for 
example, where the label is a radioactive label, means for detection include a 
scintillation counter or photographic film as in autoradiography. Where the label is a 
fluorescent label, it may be detected by exciting the fluorochrome with the appropriate 
5 wavelength of light and detecting the resulting fluorescence. The fluorescence may be 
detected visually, by means of photographic film, by the use of electronic detectors 
such as charge-coupled devices (CCDs) or photomultipliers and the like. Similarly, 
enzymatic labels may be detected by providing the appropriate substrates for the 
enzyme and detecting the resulting reaction product. Finally simple colorimetric labels 

10 may be detected simply by observing the color associated with the label. Thus, in 
various dipstick assays, conjugated gold often appears pink, while various conjugated 
beads appear the color of the bead. 

Some assay formats do not require the use of labeled components. For instance, 
agglutination assays can be used to detect the presence of the target antibodies. In this 

1 5 case, antigen-coated particles are agglutinated by samples comprising the target 

antibodies. In this format, none of the components need be labeled and the presence of 
the target antibody is detected by simple visual inspection. 

Detection of Modulators 

20 Compositions and methods for determining whether a test compound 

specifically binds to a T1R receptor of the invention, both in vitro and in vivo, are 
described below. Many aspects of cell physiology can be monitored to assess the effect 
of ligand binding to a T1R polypeptide of the invention. These assays may be 
performed on intact cells expressing a chemosensory receptor, on permeabilized cells, 

25 or on membrane fractions produced by standard methods or in vitro de novo 
synthesized proteins. 

In vivo, taste receptors bind tastants and initiate the transduction of chemical 
stimuli into electrical signals. An activated or inhibited G protein will in turn alter the 
properties of target enzymes, channels, and other effector proteins. Some examples are 

30 the activation of cGMP phosphodiesterase by transducin in the visual system, adenylate 
cyclase by the stimulatory G protein, phospholipase C by Gq and other cognate G 
proteins, and modulation of diverse channels by Gi and other G proteins. Downstream 
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consequences can also be examined such as generation of diacyl glycerol and IP 3 by 
phospholipase C, and in turn, for calcium mobilization by IP3. 

The T1R proteins or polypeptides of the assay will preferably be selected from a 
polypeptide having the T1R polypeptide sequence selected from those disclosed in 
5 Example 1, or fragments or conservatively modified variants thereof. Optionally, the 
fragments and variants can be antigenic fragments and variants which bind to an anti- 
T1R antibody. Optionally, the fragments and variants can bind to or are activated by 
sweeteners or umami tastants. 

Alternatively, the T1R proteins or polypeptides of the assay can be derived from 

10 a eukaryotic host cell and can include an amino acid subsequence having amino acid 
sequence identity to the T1R polypeptides disclosed in Example 1, or fragments, or 
conservatively modified variants thereof. Generally, the amino acid sequence identity 
will be at least 35 to 50%, or optionally 75%, 85%, 90%, 95%, 96%, 97%, 98%, or 
99%. Optionally, the T1R proteins or polypeptides of the assays can comprise a 

15 domain of a T1R protein, such as an extracellular domain, transmembrane region, 
transmembrane domain, cytoplasmic domain, ligand-binding domain, and the like. 
Further, as described above, the T1R protein or a domain thereof can be covalently 
linked to a heterologous protein to create a chimeric protein used in the assays 
described herein. 

20 Modulators of T1R receptor activity are tested using T1R proteins or 

polypeptides as described above, either recombinant or naturally occurring. The T1R 
proteins or polypeptides can be isolated, co-expressed in a cell, co-expressed in a 
membrane derived from a cell, co-expressed in tissue or in an animal, either 
recombinant or naturally occurring. For example, tongue slices, dissociated cells from 

25 a tongue, transformed cells, or membranes can be used. Modulation can be tested using 
one of the in vitro or in vivo assays described herein. 

For example, as disclosed in the experiment examples infra, it has been 
discovered that certain 5' nucleotides, e.g., 5' IMP or 5' GMP, enhance the activity of 
L-glutamate to activate the umami taste receptor, or block the activation of the umami 

30 taste receptor by umami taste stimuli such as L-glutamate and L-aspartate. 

1. In vitro binding assays 
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Taste transduction can also be examined in vitro with soluble or solid state 
reactions, using the T1R polypeptides of the invention. In a particular embodiment, 
T1R ligand-binding domains can be used in vitro in soluble or solid state reactions to 
assay for ligand binding. 
5 For instance, the T1R N-tenninal domain is predicted to be involved in ligand 

binding. More particularly, the TIRs belong to a GPCR sub-family that is 
characterized by large, approximately 600 amino acid, extracellular N-terminal 
segments. These N-terminal segments are thought to form the ligand-binding domains, 
and are therefore useful in biochemical assays to identify T1R agonists and antagonists. 
10 It is possible that the ligand-binding domain may be formed by additional portions of 
the extracellular domain, such as the extracellular loops of the transmembrane domain. 

In vitro binding assays have been used with other GPCRs that are related to the 
TIRs, such as the metabotropic glutamate receptors (see, e.g, Han and Hampson, J. 
Biol Chem. 274:10008-10013 (1999)). These assays might involve displacing a 
1 5 radioactively or fluorescently labeled ligand, measuring changes in intrinsic 
fluorescence or changes in proteolytic susceptibility, etc. 

Ligand binding to a hetero-multimeric complex of T1R polypeptides of the 
invention can be tested in solution, in a bilayer membrane, optionally attached to a solid , 
phase, in a lipid monolayer, or in vesicles. Binding of a modulator can be tested using, 
20 e.g. , changes in spectroscopic characteristics (e.g. , fluorescence, absorbence, refractive * 
index) hydrodynamic (e.g., shape), chromatographic, or solubility properties. 
[0207] In another embodiment of the invention, a GTPy 35 S assay may be used. As 
described above, upon activation of a GPCR, the Got subunit of the G protein complex 
is stimulated to exchange bound GDP for GTP. Ligand-mediated stimulation of G 
25 protein exchange activity can be measured in a biochemical assay measuring the 
binding of added radioactively labeled GTPy 35 S to the G protein in the presence of a 
putative ligand. Typically, membranes containing the chemosensory receptor of 
interest are mixed with a complex of G proteins. Potential inhibitors and/or activators 
and GTPy 35 S are added to the assay, and binding of GTPy 35 S to the G protein is 
30 measured. Binding can be measured by liquid scintillation counting or by any other 
means known in the art, including scintillation proximity assays (SPA). In other assays 
formats, fluorescently labeled GTPyS can be utilized. 
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2. Fluorescence Polarization Assays 
In another embodiment, Fluorescence Polarization ( 4 TP") based assays may be 
used to detect and monitor ligand binding. Fluorescence polarization is a versatile 
laboratory technique for measuring equilibrium binding, nucleic acid hybridization, and 
5 enzymatic activity. Fluorescence polarization assays are homogeneous in that they do 
not require a separation step such as centrifugation, filtration, chromatography, 
precipitation, or electrophoresis. These assays are done in real time, directly in solution 
and do not require an immobilized phase. Polarization values can be measured 
repeatedly and after the addition of reagents since measuring the polarization is rapid 
10 and does not destroy the sample. Generally, this technique can be used to measure 
polarization values of fluorophores from low picomolar to micromolar levels. This 
section describes how fluorescence polarization can be used in a simple and 
quantitative way to measure the binding of ligands to the T1R polypeptides of the 
invention. 

1 5 When a fluorescently labeled molecule is excited with plane-polarized light, it ; 

emits light that has a degree of polarization that is inversely proportional to its 
molecular rotation. Large fluorescently labeled molecules remain relatively stationary 
during the excited state (4 nanoseconds in the case of fluorescein) and the polarization 
of the light remains relatively constant between excitation and emission. Small 

20 fluorescently labeled molecules rotate rapidly during the excited state and the 

polarization changes significantly between excitation and emission. Therefore, small 
molecules have low polarization values and large molecules have high polarization 
values. For example, a single-stranded fluorescein-labeled oligonucleotide has a 
relatively low polarization value but when it is hybridized to a complementary strand, it 

25 has a higher polarization value. When using FP to detect and monitor tastant-binding 

which may activate or inhibit the chemosensory receptors of the invention, 

fluorescence-labeled tastants or auto-fluorescent tastants may be used. 

Fluorescence polarization (P) is defined as: 

p _ Int u -Int ± 
Int u +Int ± 

30 Where II is the intensity of the emission light parallel to the excitation light plane and 
Int ± is the intensity of the emission light perpendicular to the excitation light plane. P, 
being a ratio of light intensities, is a dimensionless number. For example, the Beacon 
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® and Beacon 2000 ™ System may be used in connection with these assays. Such 
systems typically express polarization in millipolarization units (1 Polarization Unit 
=1000 mP Units). 

The relationship between molecular rotation and size is described by the Perrin 
5 equation and the reader is referred to Jolley, M. E. (1991) in Journal of Analytical 
Toxicology, pp. 236-240, which gives a thorough explanation of this equation. 
Summarily, the Perrin equation states that polarization is directly proportional to the 
rotational relaxation time, the time that it takes a molecule to rotate through an angle of 
approximately 68.5° Rotational relaxation time is related to viscosity (t)), absolute 
1 0 temperature (T), molecular volume (V), and the gas constant (R) by the following 
equation: 

3nV 

RotationalRelaxationTime = — 

RT 

The rotational relaxation time is small (« 1 nanosecond) for small molecules 
(e.g. fluorescein) and large (» 100 nanoseconds) for large molecules (e.g. 

15 immunoglobulins). If viscosity and temperature are held constant, rotational relaxation 
time, and therefore polarization, is directly related to the molecular volume. Changes 
in molecular volume may be due to interactions with other molecules, dissociation, 
polymerization, degradation, hybridization, or conformational changes of the 
fluorescently labeled molecule. For example, fluorescence polarization has been used 

20 to measure enzymatic cleavage of large fluorescein labeled polymers by proteases, 
DNases, and RNases. It also has been used to measure equilibrium binding for 
protein/protein interactions, antibody/antigen binding, and protein/DNA binding. 

A. Solid state and soluble high throughput assays 
25 In yet another embodiment, the invention provides soluble assays using a 

hetero-oligomeric T1R polypeptide complex; or a cell or tissue co-expressing T1R 
polypeptides. Preferably, the cell will comprise a cell line that stably co-expresses a 
functional T1R1/T1R3 (umami) taste receptor or T1R2/T1R3 (sweet) taste receptor. la 
another embodiment, the invention provides solid phase based in vitro assays in a high 
30 throughput format, where the T1R polypeptides, or cell or tissue expressing the T1R 
polypeptides is attached to a solid phase substrate or a taste stimulating compound and 
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contacted with a T1R receptor, and binding detected using an appropriate tag or 
antibody raised against the T1R receptor. 

In the high throughput assays of the invention, it is possible to screen up to 
several thousand different modulators or ligands in a single day. In particular, each 
5 well of a microtiter plate can be used to run a separate assay against a selected potential 
modulator, or, if concentration or incubation time effects are to be observed, every 5-10 
wells can test a single modulator. Thus, a single standard microtiter plate can assay 
about 100 (e.g., 96) modulators. If 1536 well plates are used, then a single plate can 
easily assay from about 1000 to about 1500 different compounds. It is also possible to 

1 0 assay multiple compounds in each plate well. It is possible to assay several different 
plates per day; assay screens for up to about 6,000-20,000 different compounds is 
possible using the integrated systems of the invention. More recently, microfluidic 
approaches to reagent manipulation have been developed. 

The molecule of interest can be bound to the solid state component, directly or 

15 indirectly, via covalent or hon-covalent linkage, eg., via a tag. The tag can be any of a 
variety of components. In general, a molecule which binds the tag (a tag binder) is 
fixed to a solid support, and the tagged molecule of interest (e.g., the taste transduction 
molecule of interest) is attached to the solid support by interaction of the tag and the tag 
binder. 

20 A number of tags and tag binders can be used, based upon known molecular 

interactions well described in the literature. For example, where a tag has a natural 
binder, for example, biotin, protein A, or protein G, it can be used in conjunction with 
appropriate tag binders (avidin, streptavidin, neutravidin, the Fc region of an 
immunoglobulin, etc). Antibodies to molecules with natural binders such as biotin are 

25 also widely available and appropriate tag binders (see, SIGMA Immunochemicals 1998 
catalogue SIGMA, St. Louis MO). 

Similarly, any haptenic or antigenic compound can be used in combination with 
an appropriate antibody to form a tag/tag binder pair. Thousands of specific antibodies 
are commercially available and many additional antibodies are described in the 

30 literature. For example, in one common configuration, the tag is a first antibody and 
the tag binder is a second antibody which recognizes the first antibody. In addition to 
antibody-antigen interactions, receptor-ligand interactions are also appropriate as tag 
and tag-binder pairs. For example, agonists and antagonists of cell membrane receptors 
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(e.g., cell receptor-ligand interactions such as transferrin, c-kit, viral receptor ligands, 
cytokine receptors, chemokine receptors, interleukin receptors, immunoglobulin 
receptors and antibodies, the cadherein family, the integrin family, the selectin family, 
and the like; see, e.g, Pigott & Power, The Adhesion Molecule Facts Book I (1993)). 
5 Similarly, toxins and venoms, viral epitopes, hormones (e.g. , opiates, steroids, etc.), 
intracellular receptors (e.g., which mediate the effects of various small ligands, 
including steroids, thyroid hormone, retinoids and vitamin D; peptides), drugs, lectins, 
sugars, nucleic acids (both linear and cyclic polymer configurations), oligosaccharides, 
proteins, phospholipids and antibodies can all interact with various cell receptors. 

10 Synthetic polymers, such as polyurethanes, polyesters, polycarbonates, 

polyureas, polyamides, polyethyleneimines, polyarylene sulfides, polysiloxanes, 
polyimides, and polyacetates can also form an appropriate tag or tag binder. Many 
other tag/tag binder pairs are also useful in assay systems described herein, as would be 
apparent to one of skill upon review of this disclosure. 

15 Common linkers such as peptides, polyethers, and the like can also serve as 

tags, and include polypeptide sequences, such as poly gly sequences of between about 5 
and 200 amino acids. Such flexible linkers are known to persons of skill in the art. For 
example, poly(ethelyne glycol) linkers are available from Shearwater Polymers, Inc; 
Huntsville, Alabama. These linkers optionally have amide linkages, sulfhydryl 

20 linkages, or heterofunctional linkages. 

Tag binders are fixed to solid substrates using any of a variety of methods 
currently available. Solid substrates are commonly derivatized or functionalized by 
exposing all or a portion of the substrate to a chemical reagent which fixes a chemical 
group to the surface which is reactive with a portion of the tag binder. For example, 

25 groups which are suitable for attachment to a longer chain portion would include 
amines, hydroxyl, thiol, and carboxyl groups. Aminoalkylsilanes and 
hydroxyalkylsilanes can be used to functionalize a variety of surfaces, such as glass 
surfaces. The constitutive of such solid phase biopolymer arrays is well described in 
the literature. See, e.g., Merrifield, J. Am. Chem. Soc, 85:2149-2154 (1963) 

30 (describing solid phase synthesis of, e.g., peptides); Geysen et al, J. Immun. Meth., 

102:259-274 (1987) (describing synthesis of solid phase components on pins); Frank & 
Doring, Tetrahedron, 44:60316040 (1988) (describing synthesis of various peptide 
sequences on cellulose disks); Fodor et al, Science, 251:767-777 (1991); Sheldon et 
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al, Clinical Chemistry, 39(4):718-719 (1993); and Kozal et al, Nature Medicine, 
2(7):753759 (1996) (all describing arrays of biopolymers fixed to solid substrates). 
Non-chemical approaches for fixing tag binders to substrates include other common 
methods, such as heat, cross-linking by UV radiation, and the like. 

5 

3. Cell-based assays 
In a preferred embodiment of treatment, a combination of T1R proteins or 
polypeptides are transiently or stably co-expressed in a eukaryotic cell either in 
unmodified forms or as chimeric, variant or truncated receptors with or preferably 

10 without a heterologous, chaperone sequence that facilitates its maturation and targeting 
through the secretory pathway. Such T1R polypeptides can be expressed in any 
eukaryotic cell, such as HEK-293 cells. Preferably, the cells comprise a functional G 
protein, e.g., Gocl5 or the chimeric G protein previously identified, or another G protein 
that is capable of coupling the chimeric receptor to an intracellular signaling pathway 

15 or to a signaling protein such as phospholipase C. Also, preferably a cell will be 

produced that stably co-expresses T1R1/T1R3 or T1R2/T1R3 as such cells have been 
found (as shown in the experimental examples) to exhibit enhanced responses to taste 
stimuli (relation to cells that transiently express the same T1R combination). 
Activation of T1R receptors in such cells can be detected using any standard method, 

20 such as by detecting changes in intracellular calcium by detecting Fluo-4 dependent 
fluorescence in the cell. Such an assay is the basis of the experimental findings 
presented in this application. 

Activated GPCR receptors often are substrates for kinases that phosphorylate 
the C-terminal tail of the receptor (and possibly other sites as well). Thus, activators 

25 will promote the transfer of 32 P from radiolabeled ATP to the receptor, which can be 
assayed with a scintillation counter. The phosphorylation of the C-terminal tail will 
promote the binding of arrestin-like proteins and will interfere with the binding of G 
proteins. For a general review of GPCR signal transduction and methods of assaying 
signal transduction, see, e.g., Methods in Enzymology, vols. 237 and 238 (1994) and 

30 volume 96 (1983); Bourne et al, Nature, 10:349:1 17-27 (1991); Bourne et al, Nature, 
348: 125-32 (1990); Pitcher et al, Annu. Rev. Biochem., 67:653-92 (1998). 

T1R modulation may be assayed by comparing the response of T1R 
polypeptides treated with a putative T1R modulator to the response of an untreated 
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control sample or a sample containing a known "positive" control. Such putative T1R 
modulators can include molecules that either inhibit or activate T1R polypeptide 
activity. In one embodiment, control samples (untreated with activators or inhibitors) 
are assigned a relative T1R activity value of 100. Inhibition of a T1R polypeptide is 
5 achieved when the T1R activity value relative to the control is about 90%, optionally 
50%, optionally 25-0%. Activation of a T1R polypeptide is achieved when the T1R 
activity value relative to the control is 110%, optionally 150%, 200-500%, or 1000- 
2000%. 

Changes in ion flux may be assessed by determining changes in ionic 
10 polarization (i.e., electrical potential) of the cell or membrane expressing a T1R 

polypeptide. One means to determine changes in cellular polarization is by measuring 
changes in current (thereby measuring changes in polarization) with voltage-clamp and 
patch-clamp techniques (see, e.g., the "cell-attached" mode, the "inside-out" mode, and 
the "whole cell" mode, e.g., Ackerman et al, New Engl J Med., 336:1575-1595 

15 (1 997)). Whole cell currents are conveniently determined using the standard. Other 
known assays include: radiolabeled ion flux assays and fluorescence assays using 
voltage-sensitive dyes (see, e.g., Vestergaird-Bogind et al, J. Membrane Biol, 
88:67-75 (1988); Gonzales & Tsien, Chem. Biol, 4:269277 (1997); Daniel et al, J. 
Pharmacol Meth., 25:185-193 (1991); Holevinsky etal,J. Membrane Biology, 

.20 137:59-70(1994)). 

The effects of the test compounds upon the function of the polypeptides can be 
measured by examining any of the parameters described above. Any suitable 
physiological change that affects GPCR activity can be used to assess the influence of a 
test compound on the polypeptides of this invention. When the functional 

25 consequences are determined using intact cells or animals, one can also measure a 
variety of effects such as transmitter release, hormone release, transcriptional changes 
to both known and uncharacterized genetic markers (e.g., northern blots), changes in 
cell metabolism such as cell growth or pH changes, and changes in intracellular second 
messengers such as Ca 2+ , IP3, cGMP, or cAMP. 

30 Preferred assays for GPCRs include cells that are loaded with ion or voltage 

sensitive dyes to report receptor activity. Assays for determining activity of such 
receptors can-also use known agonists and antagonists for other G protein-coupled 
receptors as controls to assess activity of tested compounds. In assays for identifying 
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modulatory compounds (e.g., agonists, antagonists), changes in the level of ions in the 
cytoplasm or membrane voltage will be monitored using an ion sensitive or membrane 
voltage fluorescent indicator, respectively. Among the ion-sensitive indicators and . 
voltage probes that may be employed are those disclosed in the Molecular Probes 1997 
5 Catalog. For G protein-coupled receptors, promiscuous G proteins such as Gal 5 and 
Gctl6 can be used in the assay of choice (Wilkie et ai 9 Proc. Nat'l Acad. ScL 9 
88:10049-10053 (1991)). 

Receptor activation initiates subsequent intracellular events, e.g., increases in 
second messengers. Activation of some G protein-coupled receptors stimulates the 

1 0 formation of inositol triphosphate (IP3) through phospholipase C-mediated hydrolysis 
of phosphatidylinositol (Berridge & Irvine, Nature, 312:315-21 (1984)). IP3 in turn 
stimulates the release of intracellular calcium ion stores. Thus, a change in cytoplasmic 
calcium ion levels, or a change in second messenger levels such as H?3 can be used to 
assess G protein-coupled receptor function. Cells expressing such G protein-coupled 

1 5 receptors may exhibit increased cytoplasmic calcium levels as a result of contribution 
from both calcium release from intracellular stores and extracellular calcium entry via 
plasma membrane ion channels. 

In a preferred embodiment, T1R polypeptide activity is measured by stably or 
transiently co-expressing T1R genes, preferably stably, in a heterologous cell with a 

20 promiscuous G protein that links the receptor to a phospholipase C signal transduction 
pathway {see Offermanns & Simon, I Biol Chem., 270:15175-15180 (1995)). In a 
preferred embodiment, the cell line is HEK-293 (which does not normally express T1R 
genes) and the promiscuous G protein is Gal 5 (Offermanns & Simon, supra). 
Modulation of taste transduction is assayed by measuring changes in intracellular Ca 2+ 

25 levels, which change in response to modulation of the T1R signal transduction pathway 
via administration of a molecule that associates with T1R polypeptides. Changes in 
Ca 2+ levels are optionally measured using fluorescent Ca 2+ indicator dyes and 
fluorometric imaging. 

In another embodiment, phosphatidyl inositol (PI) hydrolysis can be analyzed 

30 according to U.S. Patent 5,436,128, herein incorporated by reference. Briefly, the 
assay involves labeling of cells with 3H-myoinositol for 48 or more hrs. The labeled 
cells are treated with a test compound for one hour. The treated cells are lysed and 
extracted in chloroform-methanol-water after which the inositol phosphates were 
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separated by ion exchange chromatography and quantified by scintillation counting. 
Fold stimulation is determined by calculating the ratio of cpm in the presence of 
agonist, to cpm in the presence of buffer control. Likewise, fold inhibition is 
determined by calculating the ratio of cpm in the presence of antagonist, to cpm in the 
5 presence of buffer control (which may or may not contain an agonist). 

Other receptor assays can involve determining the level of intracellular cyclic 
nucleotides, e.g., cAMP or cGMP. In cases where activation of the receptor results in a 
decrease in cyclic nucleotide levels, it may be preferable to expose the cells to agents 
. that increase intracellular cyclic nucleotide levels, e.g., forskolin, prior to adding a 

10 receptor-activating compound to the cells in the assay. In one embodiment, the changes 
in intracellular cAMP or cGMP can be measured using immunoassays. The method 
described in Offermanns & Simon, J. Bio. Chem. 9 270:15175-15180 (1995), may be 
used to determine the level of cAMP. Also, the method described in Felley-Bosco et 
al.,Am. J. Resp. Cell andMol. Biol, 11:159-164 (1994), may be used to determine the 

15 level of cGMP. Further, an assay kit for measuring cAMP and/or cGMP is described in 
U.S. Patent 4,1 1 5,538, herein incorporated by reference. 

In another embodiment, transcription levels can be measured to assess the 
effects of a test compound on signal transduction. A host cell containing T1R 
polypeptides of interest is contacted with a test compound for a sufficient time to effect 

20 any interactions, and then the level of gene expression is measured. The amount of 
time to effect such interactions may be empirically determined, such as by running a 
time course and measuring the level of transcription as a function of time. The amount 
of transcription may be measured by using any method known to those of skill in the art 
to be suitable. For example, mRNA expression of the protein of interest may be 

25 detected using northern blots or their polypeptide products may be identified using 
immunoassays. Alternatively, transcription based assays using reporter gene may be 
used as described in U.S. Patent 5,436,128, herein incorporated by reference. The 
reporter genes can be, e.g., chloramphenicol acetyltransferase, luciferase, 
beta-galactosidase beta-lactamase and alkaline phosphatase. Furthermore, the protein 

30 of interest can be used as an indirect reporter via attachment to a second reporter such 
as green fluorescent protein {see, e.g., Mistili & Spector, Nature Biotechnology, 
15:961-964(1997)). 
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The amount of transcription is then compared to the amount of transcription in 
either the same cell in the absence of the test compound, or it may be compared with 
the amount of transcription in a substantially identical cell that lacks the T1R 
polypeptide(s) of interest. A substantially identical cell may be derived from the same 
5 cells from which the recombinant cell was prepared but which had not been modified 
by introduction of heterologous DNA. Any difference in the amount of transcription 
indicates that the test compound has in some manner altered the activity of the T1R 
polypeptides of interest. 

10 4. Transgenic non-human animals expressing chemosensory receptors 

Non-human animals expressing a combination of T1R taste receptor sequences 
of the invention can also be used for receptor assays. Such expression can be used to 
determine whether a test compound specifically binds to a mammalian taste 
transmembrane receptor complex in vivo by contacting a non-human animal stably or 
1 5 transiently transfected with nucleic acids encoding chemosensory receptors or ligand- 
binding regions thereof with a test compound and determining whether the animal 
reacts to the test compound by specifically binding to the receptor polypeptide 
complex. 

Animals transfected or infected with the vectors of the invention are particularly 
20 useful for assays to identify and characterize taste stimuli that can bind to a specific or 
sets of receptors. Such vector-infected animals expressing human taste receptor 
sequences can be used for in vivo screening of taste stimuli and their effect on, e.g., cell 
physiology (e.g., on taste neurons), on the CNS, or behavior. Alternatively, stable cell 
lines that express a T1R or combination thereof, can be used as nucleic transfer donors 
25 to produced cloned transgenic animals that stably express a particular T1R or 

combination. Methods of using nucleic transfer to produce cloned animals that express 
a desired heterologous DNA are the subject of several issued U.S . patents granted to the 
University of Massachusetts (licensed to Advanced Cell Technology, Inc.) andRoslin 
Institute (licensed to Geron Corp.). 
30 Means to infect/express the nucleic acids and vectors, either individually or as 

libraries, are well known in the art. A variety of individual cell, organ, or whole animal 
parameters can be measured by a variety of means. The T1R sequences of the 
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invention can be for example co-expressed in animal taste tissues by delivery with an 
infecting agent, e.g., adenovirus expression vector. 

The endogenous taste receptor genes can remain functional and wild-type 
(native) activity can still be present. In other situations, where it is desirable that all 
5 taste receptor activity is by the introduced exogenous hybrid receptor, use of a 
knockout line is preferred. Methods for the constitutive of non-human transgenic 
animals, particularly transgenic mice, and the selection and preparation of recombinant 
constructs for generating transformed cells are well known in the art. 

Constitutive of a "knockouf * cell and animal is based on the premise that the 

10 level of expression of a particular gene in a mammalian cell can be decreased or 

completely abrogated by introducing into the genome a new DNA sequence that serves 
to interrupt some portion of the DNA sequence of the gene to be suppressed. Also, 
"gene trap insertion" can be used to disrupt a host gene, and mouse embryonic stem 
(ES) cells can be used to produce knockout transgenic animals (see, e.g 9 Holzschu, 

15 Transgenic Res 6:97-106 (1997)). The insertion of the exogenous is typically by 
homologous recombination between complementary nucleic acid sequences. The 
exogenous sequence is some portion of the target gene to be modified, such as exonic, 
intronic or transcriptional regulatory sequences, or any genomic sequence which is able 
to affect the level of the target gene's expression; or a combination thereof. Gene 

20 targeting via homologous recombination in pluripotential embryonic stem cells allows 
one to modify precisely the genomic sequence of interest. Any technique can be used, 
to create, screen for, propagate, a knockout animal, e.g., see Bijvoet, Hum. Mol Genet. 
7:53-62 (1998); Moreadith, J. Mol. Med. 75:208-216 (1997); Tojo, Cytotechnology 
19:161-165 (1995); Mudgett, Methods Mol Biol 48:167-184 (1995); Longo, 

25 Transgenic Res. 6:321-328 (1997); U.S. Patents Nos. 5,616,491; 5,464,764; 5,631,153; 
5,487,992; 5,627,059; 5,272,071; WO 91/09955; WO93/09222; WO 96/2941 1; WO 
95/31560; WO 91/12650. 

The nucleic acids of the invention can also be used as reagents to produce 
<e knockout" human cells and their progeny. Likewise, the nucleic acids of the invention 

30 can also be used as reagents to produce "knock-ins" in mice. The human or rat T1R 
gene sequences can replace the orthologous T1R in the mouse genome. In this way, a 
mouse expressing a human or rat T1R is produced. This mouse can then be used to 
analyze the function of human or rat TIRs, and to identify ligands for such TIRs. 
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a. Modulators 

The compounds tested as modulators of a T1R family member can be any small 
chemical compound, or a biological entity, such as a protein, nucleic acid or lipid. 
Examples thereof include 5 1 IMP and 5 l GMP. Essentially any chemical compound 
5 can be used as a potential modulator or ligand in the assays of the invention, although 
most often compounds that are soluble in aqueous solutions are tested. Assays can be 
designed to screen large chemical libraries by automating the assay steps and providing 
compounds from any convenient source; these assays are typically run in parallel (e.g., 
in microtiter formats on microtiter plates in robotic assays). It will be appreciated that 
1 0 chemical libraries can be synthesized by one of many chemical reactions (e.g. Senomyx 
proprietary chemistries). Additionally, there are many suppliers of chemical 
compounds, including Sigma (St. Louis, MO), Aldrich (St. Louis, MO), Sigma-Aldrich 
(St. Louis, MO), Fluka Chemika-Biochemica Analytika (Buchs, Switzerland) and the 
like. 

1 5 In one preferred embodiment, high throughput screening methods involve 

providing a combinatorial chemical or peptide library containing a large number of 
potential taste affecting compounds (potential modulator or ligand compounds). Such 
"combinatorial chemical libraries" or "ligand libraries" are then screened in one or 
more assays, as described herein, to identify those library members (particular chemical 

20 species or subclasses) that display a desired characteristic activity. The compounds 
thus identified can serve as conventional "lead compounds" or can themselves be used 
as potential or actual taste modulators. 

Preferably, such libraries will be screened against cells or cell lines that stably 
express a T1R or combination of TIRs, i.e. T1R1/T1R3 or T1R2/T1R3 and preferably 

25 a suitable G protein, e.g. G a i5- As shown in the examples infra, such stable cell lines 
exhibit very pronounced responses to taste stimuli, e.g. umami or sweet taste stimuli. 
However, cells and cell lines that transiently express one or more TIRs may also be 
used in such assays. 

A combinatorial chemical library is a collection of diverse chemical compounds 
30 generated by either chemical synthesis or biological synthesis, by combining a number 
of chemical "building blocks" such as reagents. For example, a linear combinatorial 
chemical library such as a polypeptide library is formed by combining a set of chemical 
building blocks (amino acids) in every possible way for a given compound length (i.e., 



WO 2005/015158 



PCT/US2004/025459 



the number of amino acids in a polypeptide compound). Thousands to millions of 
chemical compounds can be synthesized through such combinatorial mixing of 
chemical building blocks. 

Preparation and screening of combinatorial chemical libraries is well known to 
5 those of skill in the art. Such combinatorial chemical libraries include, but are not 
limited to, peptide libraries {see, e.g., U.S. Patent 5,010,175, Furka, Int. J. Pept. Prot. 
Res., 37:487-493 (1991) and Houghton et al, Nature, 354:84-88 (1991)). Other 
chemistries for generating chemically diverse libraries can also be used. Such 
chemistries include, but are not limited to: peptoids {e.g., PCT Publication No. WO 

10 91/19735), encoded peptides {e.g., PCT Publication WO 93/20242), random 

bio-oligomers {e.g., PCT Publication No. WO 92/00091), benzodiazepines {e.g., U.S. 
Pat. No. 5,288,514), diversomers such as hydantoins, benzodiazepines and dipeptides 
(Hobbs et al, Proc. Nat. Acad. Set , 90:6909-6913 (1993)), vinylogous polypeptides 
(Hagihara et al, J. Amer. Chem. Soc, 1 14:6568 (1992)), nonpeptidal peptidomimetics 

15 with glucose scaffolding (Hirschmann et al, J. Amer. Chem. Soc, 1 14:9217-9218 

(1992)), analogous organic syntheses of small compound libraries (Chen et al, J. Amer. ;.. 
Chem. Soc, 1 16:2661 (1994)), oligocarbamates (Cho et al, Science, 261:1303 (1993)), . 
peptidyl phosphonates (Campbell et al, J. Org. Chem., 59:658 (1994)), nucleic acid 
libraries (Ausubel, Berger and Sambrook, all supra), peptide nucleic acid libraries (U.S. 

20 Patent 5,539,083), antibody libraries (Vaughn et al, Nature Biotechnology, 

14(3):309-314 (1996) and PCT/US96/10287), carbohydrate libraries (Liang et al, 
Science, 274:1520-1522 (1996) and U.S. Patent 5,593,853), small organic molecule 
libraries (benzodiazepines, Baum, C&EN, Jan 18, page 33 (1993); thiazolidinones and 
metathiazanones, U.S. Patent 5,549,974; pyrrolidines, U.S. Patents 5,525,735 and 

25 5,519,134; morpholino compounds, U.S. Patent 5,506,337; benzodiazepines, 5,288,514, 
and the like). 

Devices for the preparation of combinatorial libraries are commercially 
available {see, e.g., 357 MPS, 390 MPS (Advanced Chem Tech, Louisville KY), 
Symphony (Rainin, Woburn, MA), 433A (Applied Biosystems, Foster City, CA), 9050 
30 Plus (Millipore, Bedford, MA)). In addition, numerous combinatorial libraries are 
themselves commercially available {see, e.g., ComGenex, Princeton, NJ; Tripos, Lie, 
St. Louis, MO; 3D Pharmaceuticals, Exton, PA; Martek Biosciences; Columbia, MD; 
etc.). 
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In one aspect of the invention, the T1R modulators can be used in any food 
product, confectionery, pharmaceutical composition, or ingredient thereof to thereby 
modulate the taste of the product, composition, or ingredient in a desired manner. For 
instance, T1R modulators that enhance sweet taste sensation can be added to sweeten a 
5 product or composition; T1R modulators that enhance umami taste sensation can be 
added to foods to increase savory tastes. Alternatively, T1R antagonists can be used to 
block sweet and/or umami taste. 

b. Kits 

1 0 T1R genes and their homologs are useful tools for identifying chemosensory 

receptor cells, for forensics and paternity determinations, and for examining taste 
transduction. T1R family member-specific reagents that specifically hybridize to T1R 
nucleic acids, such as T1R probes and primers, and T1R specific reagents that 
specifically bind to a T1R polypeptide, e.g. , T1R antibodies are used to examine taste 

1 5 cell expression and taste transduction regulation. 

Nucleic acid assays for the presence of DNA and RNA for a T1R family 
member in a sample include numerous techniques are known to those skilled in the art, 
such as southern analysis, northern analysis, dot blots, RNase protection, SI analysis, 
amplification techniques such as PCR, and in situ hybridization. In in situ 

20 hybridization, for example, the target nucleic acid is liberated from its cellular 
surroundings in such as to be available for hybridization within the cell while 
preserving the cellular morphology for subsequent interpretation and analysis. The 
following articles provide an overview of the art of in situ hybridization: Singer et al, 
Biotechniques, 4:230250 (1986); Haase et al, Methods in Virology, vol. VII, pp. 

25 189-226 (1984); and Nucleic Acid Hybridization: A Practical Approach (Names et al 9 
eds. 1987). In addition, a T1R polypeptide can be detected with the various 
immunoassay techniques described above. The test sample is typically compared to 
both a positive control (e.g., a sample expressing a recombinant T1R polypeptide) and a 
negative control. 

30 The present invention also provides for kits for screening for modulators of T1R 

family members. Such kits can be prepared from readily available materials and 
reagents. For example, such kits can comprise any one or more of the following 
materials: T1R nucleic acids or proteins, reaction tubes, and instructions for testing 
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T1R activity. Optionally, the kit contains a biologically active T1R receptor or cell line 
that stably or transiently expresses a biologically active T1R containing taste receptor. 
A wide variety of kits and components can be prepared according to the present 
invention, depending upon the intended user of the kit and the particular needs of the 
5 user. 

EXAMPLES 

While the invention has been described in detail supra, the following examples 
are provided to illustrate preferred embodiments. These examples are intended to be 
10 illustrative and not limitative of the scope of the invention. 

In the protein sequences presented herein, the one-letter code X or Xaa refers to 
any of the twenty common amino acid residues. In the DNA sequences presented 

herein, the one letter codes N or n refers to any of the of the four common nucleotide 
bases, A,T, C,orG. 

15 

Example 1 

Production of In tronless hTIR Expression Constructs 

. Intronless hTIR expression constructs were cloned by a combination of cDNA- 
based and genomic DNA-based methods. To generate the full-length hTlRl 

20 expression construct, two 5* coding exons identified in a cloned hTlRl interval 
(accession # AL159177) were combined by PCR-overlap, and then joined to a 5'- 
truncated testis cDNA clone. The hTlR2 expression construct was generated from a 
partially sequenced hTlR2 genomic interval. Two missing hTlR2 5* exons were 
identified by screening shotgun libraries of the cloned genomic interval using probes 

25 derived from the corresponding rat coding sequence. Coding exons were then 

combined by PCR-overlap to produce the full-length expression construct. The hTlR3 
expression construct was generated by PCR-overlap from a sequenced hTlR3 genomic 
interval (accession # AL139287). Rat T1R3 was isolated from a rat taste tissue-derived 
cDNA library using an rTlR3 exon fragment generated by hTlR3-based degenerate 

30 PCR. The partial hTlRl cDNA, rTlR2 cDNA, and partial hTlR2 genomic sequences 
were obtained from Dr. Charles Zulcer (University of California, San Diego). 
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The nucleic acid and amino acid sequences for the above-identified T1R cloned 
sequences as well as other full-length and partial T1R sequences are set forth in the 
sequence listing. 

Also, the following conceptual translations, which correspond to the C-tennini 
5 of two orthologous pairs offish TIRs, are derived from unpublished genomic sequence 

fragments and provided. Fugu T1RA was derived from accession 'scaffold 164'; Fugu 

T1RB was derived from accession LPC6171 1; Tetradon T1RA was derived from 

accession AL226735; Tetradon T1RB was derived from accession AL222381. 

Ambiguities in the conceptual translations ('X 1 ) result from ambiguities in database 
1 0 sequences. These sequences can be found in the sequence listing. 

Additionally, the accession number and reference citations relating to mouse 

and rat TIRs and allelic variants thereof in the public domain are is set forth below: 

rTlRl (Accession # AAD18069) (Hoon et al., Cell 96 (4): 541-51 (1999)); 

rTlR2 (Accession # AAD18070) (Hoon et al., Cell 96(4): 541-59 (1999)); 
1 5 mTlRl (Accession # AAK39437); mTlR2 (Accession #AAK 39438); 

mTlR3 (Accession AAK 55537) (Max et al., Nat. Genet. 28(1): 58-63 (2001)); 

rTlRl (Accession # AAK7092) (Li et al., Mamm. Genome (12(1): 13-16 (2001)); 

mTlRl (Accession # NP 1 14073); mTlRl (Accession # AAK07091) (Li et al., Mamm. 

Genome (121):13-16 (2001)); rTlR2 (Accession # AAD18070) (Hoon et al., Cell 
20 9664): 541-551 (1999)); mTlR2 (Accession # NP114079); mTlR3 (Accession # 

AAK39436); mTlR3 (Accession # BAB47181); (Kitagawa et al., Biochem. Biophys. 

Res. Comm. 283(l):236-42 (2001)); mTlR3 (Accession #NP1 14078); mTlR3 . 

(Accession # AAK55536) (Max et al, Nat. Genet. 28(l):58-63 (2001)); and mTlR3 

(Accession No. AAK01937). 

25 

Example 2 

Sequence Alignment of Human and Rat TIRs 

Cloneld T1R sequences selected from those identified above were aligned 
against the corresponding rat TIRs. As shown in Figure 1, human T1R1, human T1R2 
30 and human T1R3 and rat T1R3 were aligned with previously described TIRs (rTlRl 
having Accession # AAD18069 and rTlR2 having Accession # AAD18070), the rat 
mGluRl metabotropic, glutamate receptor (Accession # P23385); and the human 
calcium-sensing receptor (Accession #P41 180). For clarity of the comparison, the 
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mGluRl and calcium-sensing receptor C-termini are truncated. The seven potential 
transmembrane segments are boxed in blue. Residues that contact the glutamate side- 
chain carbutylate in the mGluRl crystal structure are boxed in red, and residues that 
contact the glutamate a-amino acid moiety are boxed in green. The mGluRl and 
5 calcium-sensing receptor cysteine residues implicated in intersubunit disulfide-based 
formation are circled in purple. These cysteines are not conserved in T1R1 and T1R2, 
but are located in a degraded region of the alignment that contains a potentially 
analogous T1R3 cysteine residue, also circled. 

10 Example 3 

Demonstration by RT-PCR that hTlR2 and hTlR3 are expressed in taste tissue 

As shown in Figure 2, hTlR2 and hTlR3 are expressed in taste tissue: 
expression of both genes can be detected by RT-PCR from resected human 
circumvallate papillae. 

15 

Example 4 

Methods for Heterologous Expression of TIRs in Heterologous Cells 

AnHEK-293 derivative (Chandrashekar et al., Cell 100(6): 703-11 (2000)), 
which stably expresses Gal 5, was grown and maintained at 37°C in Dulbecco's 

20 Modified Eagle Medium (DMEM, Gibco BRL) supplemented with 10% FBS, MEM 
non-essential amino acids (Gibco BRL), and 3 (ig/ml blasticidin. For calcium-imaging 
experiments, cells were first seeded onto 24-well tissue-culture plates (approximately 
0.1 million cells per well), and transfected by lipofection with Minis TransIt-293 
(PanVera). To minimize glutamate-induced and glucose-induced desensitization, 

25 supplemented DMEM was replaced with low-glucose DMEM/GlutaMAX (Gibco 

BRL) approximately 24 hours after transfection. 24 hours later, cells were loaded with 
the calcium dye Fluo-4 (Molecular Probes), 3fiM in Dulbecco's PBS buffer (DPBS, 
GibcoBRL), for 1.5 hours at room temperature. After replacement with 250fil DPBS, 
stimulation was performed at room temperature by addition of 200^1 DPBS 

30 supplemented with taste stimuli. Calcium mobilization was monitored on a Axiovert 
SI 00 TV microscope (Zeiss) using Imaging Workbench 4.0 software (Axon). 
T1R1/T1R3 and TlR2/TlR3 responses were strikingly transient - calcium increases 
rarely persisted longer than 15 seconds - and asynchronous. The number of responding 
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cells was thus relatively constant over time; therefore, cell responses were quantitated 
by manually counting the number of responding cells at a fixed time point, typically 30 
seconds after stimulus addition. 

5 Example 5 

Human T1R2/T1R3 functions as a sweet taste receptor 

HEK cells stably expressing Gal 5 were transiently transfected with human 
T1R2, T1R3 and T1R2/T1R3, and assayed for increases in intracellular calcium in 
response to increasing concentrations of sucrose (Figure 3(a)). Also, T1R2/T1R3 dose 

10 responses were determined for several sweet taste stimuli (Figure 3(b)). The maximal 
percentage of responding cells was different for different sweeteners, ranging from 10- 
30%. For clarity, dose responses were normalized to the maximal percentage of 
responding cells. The values in Figure 3 represent the mean + s.e. of four independent 
responses. X-axis circles mark psychophysical detection thresholds determined by 

15 taste testing. Gurmarin (50-fold dilution of a filtered 10g/l Gymnema sylvestre aqueous 
extract) inhibited the response of T1R2/T1R3 to 250 mM sucrose, but not the response 
of endogenous p2-adrenergic receptor to 20 |xM isoproterenol (Figure 3(b)). Figure 
3(c) contains the normalized response of T1R2/T1R3 co-expressing cell lines to 
different sweeteners(sucrose, aspartame, D-tryptophan and saccharin). 
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Example 6 

Rat T1R2/T1R3 also functions as a sweet taste receptor 

HEK cells stably expressing Gal5 were transiently transfected with 
5 hTlR2/hTlR3, rTlR2/rTlR3, hTlR2/rTlR3, and rTlR2/hTlR3. These transfected 
cells were then assayed for increased intracellular calcium in response to 350 mM 
sucrose, 25 mM tryptophan, 15 mM aspartame, and 0.05% of monellin. The results 
with sucrose and aspartame are contained in Figure 4 and indicate that rTlR2/rTlR3 
also functions as a sweet taste receptor. Also, these results suggest that T1R2 may 
10 control T1R2/T1R3 ligand specificity. 

Example 7 

T1R2/T1R3 responses using an automated fluorescence based assay 
HEK cells stably expressing Gal 5 were transiently transfected with hTlR2 and 

1 5 hTlR3. These cells were loaded with the calcium dye Fluo-4, and their responses to a 
sweetener measured using a fluorescence plate reader. Figure 5 contains cyclamate 
(12.5 mM) responses for cells expressing hTlR2/hTlR3 and for cells expressing only 
hTlR3 (J19-22). The fluorescence results obtained indicate that responses to these 
taste stimuli only occurred in the cells expressing hTlR2/hTlR3. Figure 6 contains 

20 normalized dose-response curves, the results of which show that hTlR2 and hTlR3 
function together as a human taste receptor based on their dose-specific interaction with 
various sweet stimuli . Particularly, Figure 6 contains dose-responses for sucrose, 
tryptophan and various other commercially available sweeteners. These results indicate 
that T1R2/T1R3 is a human sweet taste receptor as the rank order and threshold values 

25 obtained in the assay closely mirror values for human sweet taste. 
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Example 8 

Ligand-binding residues of mGluRl are conserved in T1R1 

As shown in Figure 6, the key ligand-binding residues of mGluRl are 
5 conserved in T1R1 . The interaction of glutamate with mGluRl is shown with several 
key residues highlighted according to the same color scheme as Figure L 

Example 9 

Human T1R1/T1R3 functions as umami taste receptors 

1 0 HEK cells stably expressing Gal 5 were transiently transfected with human 

T1R1, T1R3 and T1R1/T1R3 and assayed for increases in intracellular calcium in 
response to increasing concentrations of glutamate (Figure 8(a)), and 0.5 mM 
glutamate), 0.2 mM IMP, and 0.5 mM glutamate plus 0.2 mM IMP (Figure 8(b)). 
Human T1R1/T1R3 dose responses were determined for glutamate in the presence and 

1 5 absence of 0.2 mM IMP (Figure 8(c)). The maximal percentages of responding cells 
was approximately 5% for glutamate and approximately 10% for glutamate plus IMP. 
For clarity, does responses are normalized to the maximal percentage of responding 
cells. The values represent the mean + s.e. of four independent responses. X-axis 
circles mark taste detection thresholds determined by taste testing. 

20 

Example 10 
PDZIP as an Export Sequence 

The six residue PDZIP sequence (SVSTW (SEQ ID NO:l)) was fused to the C- 
terminus of hTlR2 and the chimeric receptor (i.e. hTlR2-PDZDP) was transfected into 

25 an HEK-293 host cell. The surface expression of hTlR2 was then monitored using 
immunofluorescence and FACS scanning data. As shown in Figures 9A and 9B, the 
inclusion of the PDZIP sequence increased the surface expression of hTlR2-PDZIP 
relative to hTlR2. More specifically, Figure 9A shows an immunofluorescence 
staining of myc-tagged hTlR2 demonstrating that PDZIP significantly increases the 

30 amount of hTlR2 protein on the plasma membrane. Figure 9B shows FACS analysis 
data demonstrating the same result- Cells expressing myc-tagged hT!R2 are indicated 
by the dotted line and cells expressing myc-tagged hTlR2-PDZIP are indicated by the 
solid line. Particularly, Figure 10A shows untransfected Gal 5 stable host cells in HBS 
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buffer, Figure 10B shows hTlR2-PDZIP transfected Gal 5 stable hose cells in 
sweetener pool no. 5 (saccharin, sodium cyclamate, Acesulfame K, and Aspartame-20 
mM each in HBS buffer), Figure 10C shows T1R3-PDZIP transfected Gal5 stable host 
cells in sweetener pool no. 5, and Figure 10D shows hTlR2-PDZIP/hTlR3-PDZIP co- 
5 transfected Gal5 stable host cells in sweetener pool no. 5. Further, Figures 10E-10H 
show dose-dependent response of hTlR2/hTlR3 co-transfected Gal5 stable host cells 
to sucrose-E: OmM in HBS buffer; F: 30 mM; G: 60 mM; and H: 250 mM. Figures 
10I-10L shown the responses of hTlR2/hTlR3 co-transfected Gal5 stable host cells to 
individual sweeteners - 1: Aspartame (1.5 mM); J: Acesulfame K (1 mM); K: Neotame 
10 (20mM); L: Sodium cyclamate (20mM). As demonstrated by the calcium-images of 
Figure 10, hTlR2 and hTlR3 are both required for the activities triggered by the sweet 
stimuli. 

Example 11 

15 Generation of Cell Lines that Stably Co-Express T1R1/T1R3 or T1R2/T1R3 

Human cell lines that stably co-express human T1R2/T1R3 or human 
T1R1/T1R3 were generated by transfecting linearized PEAKlO-derived (Edge 
Biosystems) vectors and pCDNA 3.1/ZEO-derived (Invitrogen) vectors respectively 
containing hTlRl or hTlR2 expression construct (plasmid SAV2485 for T1R1, 

20 SAV2486 for T1R2) and hTlR3 (plasmid SXV550 for T1R3) into a G a i 5 expressing 
cell line. Specifically, T1R2/T1R3 stable cell lines were produced by co-transfecting 
linearized SAV2486 and SXV550 into Aurora Bioscience's HEK-293 cell line that 
stably expresses G a i5. T1R1/T1R3 stable cell lines were produced by co-transfecting 
linearized SAV2485 and SXV550 into the same HEK-293 cell line that stably 

25 expresses G a is. Following SAV2485/SXV550 and SAV2486/SXV550 transfections, 
puromycin-resistant and zeocin-resistant colonies were selected, expanded, and tested 
by calcium imaging for responses to sweet or umami taste stimuli. Cells were selected 
in 0.0005 mg/ml puromycin (CALBIOCHEM) and 0.1 mg/ml zeocin (Invitrogen) at 
37°C in low-glucose DMEM supplemented with GlutaMAX, 10% dialyzed FBS, and 

30 0.003 mg/ml blasticidin. Resistant colonies were expanded, and their responses to 

sweet taste stimuli evaluated by Fluorescence microscopy. For automated fluorimetric 
imaging on VIPR-II instrumentation (Aurora Biosciences), T1R2/T1R3 stable cells 
were first seeded onto 96-well plates (approximately 100,000 cells per well). Twenty- 
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four hours later, cells were loaded with the calcium dye fluo-3-AM (Molecular Probes), 
0.005 mM in PBS, for one hour at room temperature. After replacement with 70 \il 
PBS, stimulation was performed at room temperature by addition of 70 id PBS 
supplemented with taste stimuli. Fluorescence (480 nm excitation and 535 nm 
5 emission) responses from 20 to 30 seconds following compound addition were 

averaged, corrected for background fluorescence measured prior to compound addition, 
and normalized to the response to 0.001 mM ionomycin (CALBIOCHEM), a calcium 
ionophore. 

It was then observed that when these cell lines were exposed to sweet or umami 

10 stimuli, that for active clones typically 80-100% of cells responded to taste stimuli. 

Unexpectedly, the magnitude of individual cell responses was markedly larger than that 
of transiently transfected cells. 

Based on this observation, the inventors tested the activity of T1R stable cell 
lines by automated fluorescence imaging using Aurora Bioscience's VTPR 

1 5 instrumentation as described above. The responses of two T1R1/T1R3 and one 
T1R2/T1R3 cell line are shown in Figure 1 1 and Figure 12 respectively. 

Remarkably, the combination of increased numbers of responding cells and 
increased response magnitudes resulted in a greater than 10-fold increase in activity 
relative to transiently transfected cells. (By way of comparison, the percent ionomycin 

20 response for cells transiently transfected with T1R2/T1R3 was approximately 5% under 
optimal conditions.) Moreover, dose responses obtained for stably expressed human 
T1R2/T1R3 and T1R1/T1R3 correlated with human taste detection thresholds. The 
robust T1R activity of these stable cell lines suggests that they are well suited for use in 
high-throughput screening of chemical libraries in order to identify compounds, e.g. 

25 small molecules, that modulate the sweet or umami taste receptor and which therefore 
modulate, enhance, block or mimic sweet or umami taste. 

Example 12 

Generation of cell lines that inducibly co-express T1R1/T1R3 which selectively 
30 respond to umami taste stimuli 

T1R1/T1R3 HEK 293 cell lines that stably expressed the umami taste receptor 
display robust improved activity relative to transiently tranfected cites. However, a 
disadvantage is that they can rapidly lose activity during cell propagation. 
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Also, these findings show that (i) T1R1/T1R3 is a umami taste receptor, i.e., 
and (ii) that cell lines which robustly express T1R1/T1R3, preferably stable and/or 
inducible T1R1/T1R3 cell lines can be used in assays, preferably for high throughput 
screening of chemical libraries to identify novel modulators of umami taste. 
5 Modulators that enhance umami taste may be used. 

To overcome the instability of the T1R1/T1R3 stable cell lines, the HEK-G a i 5 
cells have been engineered to inducibly express T1R1/T1R3 using the GeneSwitch 
system (Invitrogen). pGene-derived zeocin-resistant expression vectors for human 
T1R1 and T1R3 (plasmid SXV603 for T1R1 and SXV611 for T1R3) and a puromycin- 

1 0 resistant pSwitch-derived vector that carries the GeneSwitch protein (plasmid SXV628) 
were linearized and cotransfected into the HEK-G a i5 cell line. Zeocin-resistant and 
puromycin-resistant colonies were selected, expanded, induced with variable amounts 
of mifepristone, and tested by calcium imaging for responses to umami taste stimuli. 
Inducible expression of T1R1/T1R3 resulted in robust activity. For example, 

15 approximately 80% of induced cells but only approximately 10% of transiently 

transfected cells responded to L-glutamate; More specifically, pGene derived Zeocin- 
resistant expression vectors that express human T1R1 and human T1R3 and a 
puromycn-resistant pSwitch-derived vector that carries the GeneSwitch protein were 
linearized and co-tranfected into G a is cells. Cells were selected in 0.5 jig/ml 

20 puromycin (CAL BIOCHEM) and 100 ng/ml Zeocin (Invitrogen) at 37°C in 

Dulbecco's Modified Eagle Medium supplemented with GlutaMAX, (10 % dialyzed 
FBS, and 3 ug/ml blasticidin. Resistant colonies were expanded, and their responses to 
umami taste stimuli following induction with 10" I0 M mifepristone determined by 
fluorescence microscopy following the methods of Li et al, PNAS 99(7): 4692-4696 

25 (2002). 

For automated fluorometric imaging on FLIPR instrumentation (Molecular 
Device), cells from one clone (designated clone 1-17) were seeded into 96-well plates 
(approximately 80,000 cell per well) in the presence of 10" 10 M mifepristone and 
incubated for 48 hours. Cells were then loaded with the calcium dye fluo-4-AM 
30 (Molecular Probes), 3 jiM in PBS, for 1.5 hours at room temperature. 

After replacement with 50 nl PBS, stimulation was performed at room 
temperature by the addition of 50 \il PBS supplemented with different stimuli. In 
contrast to previous transient T1R1/T1R3 umami receptor expression systems that 
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necessitated quantifying T1R1/T1R3 receptor activity by individually counting 
responding cells (Li et al., PNAS 99(7): 4692-4696 (2002)) (because of the low activity 
of the receptor therein), the subject inducible expression system resulted in a clone 1-17 
having substantially increased activity that allowed receptor activity to be quantified by 
5 determining maximal fluorescence increases (480 nm excitation and 535 nm emission) 
summated over fields of imaged cells. The maximal fluorescence from four 
independent determinations was averaged, corrected for background fluorescence 
measured prior to compound addition, and normalized to the response to 0.002 mM 
ionomycin (CALBIOCHEM). 

10 These results are contained in Figure 13. Particularly, Figure 13 contains a 

dose-response curve determined for L-glutamate in the presence and absence of 0.2 
mM IMP. In the figure, each value represents average summated maximal fluorescence 
(corrected for background fluorescence) for four independent determinations. These 
dose-response curves correspond to those determined for cells transiently transfected 

15 withTlRl/TlR3. 

The selectivity of the umami T1R1/T1R3 taste receptor was also evaluated by 
screening with different L-amino acids. The results obtained indicated that T1R1/T1R3 
is selectively activated by the umami-tasting L-amino acids (L-glutamate and L- ; 
aspartate). 

20 The results of experiments wherein the responses of the 1-17 clone was resulted 

in tested in the presence of different L-amino acids are contained in Figure 14 and 
Figure 15. Figure 14 shows the results of an experiment wherein the 1-17 cell line was 
contacted with different L-amino acids at a concentration of lOmM in the presence and 
absence of ImM IMP. 

25 Figure 15 contains a dose-response curve for active amino acids determined in 

the presence of 0.2mM IMP. Each value represents the average of four independent 
determinations. 

The results obtained in these experiments support the specificity and selectivity 
of the umami taste receptor to umami taste stimuli. Whereas the umami taste stimuli L- 
30 glutamate and L-aspartate significantly activated the T1R1/T1R3 receptor at different 
concentrations (see Figure 14 and 15), the other L-amino acids which activated the 
human T1R1/T1R3 receptor only activated the receptor weakly and at much higher 
concentrations. 
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Therefore, these results support the selectivity of the T1R1/T1R3 receptor for 
umami taste stimuli and the suitability of this inducible stable expression system for use 
in high throughput screening assays using automated fluorometric imaging 
instrumentation to identify compounds that activate the umami taste receptor, for 
5 example L-glutamate or L-aspartate , or which enhance the activity of L-glutamate to 
activate the umami taste receptor, for example 5'-IMP or 5'-GMP, or block the 
activation of the umami taste receptor by umami taste stimuli such as L-glutamate and 
L-aspartate. 

Compounds identified using these assays have potential application as 
10 flavorants in foods and beverage compositions for mimicing or blocking umami taste 
stimuli. 

Example 13 

Lactisole Inhibits the Receptor Activities of Human T1R2/T1R3 and T1R1/T1R3 , 
15 and Sweet and Umami Taste 

Lactisole, an aralkyl carboxylic acid, was thought to be a selective sweet-taste 
inhibitor (See e.g., Lindley (1986) U.S. Patent 4,567,053; and Schiffinan et al. Chem 
Senses 24:439-447 (1999)). Responses of HEK-G a i5 cells transiently transfected with 
T1R2/T1R3 to 1 50 mM sucrose in the presence of variable concentrations of lactisole 
20 were measured. Lactisole inhibits the activity of human T1R2/T1R3 with an IC50 of 24 

The T1R1/T1R3 umami and T1R2/T1R3 sweet taste receptor may share a 
common subunit. It has therefore been theorized that lactisole, which inhibit the 
T1R2/T1R3 sweet taste receptor, may have a similar effect on the T1R1/T1R3 umami 

25 taste receptor. The present inventors tested the effect of lactisole on the response of 
human T1R1/T1R3 to lOmM L-Glutamate. As with the T1R2/T1R3 sweet receptor, 
lactisole inhibited T1R1/T1R3 with an IC50 of 165 pM. Lactisole inhibition likely 
reflects antagonism at the T1R receptors instead of, for example, non-specific 
inhibition of G a i5-mediated signaling because the response of muscarinic acetylcholine 

30 receptors was not inhibited by lactisole. 

The present inventors then evaluated the effect of lactisole on human umami 
taste. Taste thresholds in the presence of 1 and 2 mM lactisole were determined for the 
umami taste stimuli L-Glutamate with or without 0.2 mM IMP, the sweet taste stimuli 
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sucrose and D-tryptophan, and the salty taste stimulus sodium chloride following the 
methods of Schiffinan et al. (Chem. Senses 24: 439-447 (1989)). Millimolar 
concentrations of lactisole dramatically increased detection thresholds for sweet and 
umami but not salt taste stimuli. These results are contained in Figure 16. 
5 In conclusion, (i) these findings further support the inventors' hypothesis that 

T1R1/T1R3 is the only umami taste receptor, and (ii) the T1R1/T1R3 and T1R2/T1R3 
receptors may share a structurally related lactisole-binding domain. 

While the foregoing detailed description has described several embodiments of 
the present invention, it is to be understood that the above description is illustrative 
10 only and not limiting of the disclosed invention. The invention is to be limited only by 
the claims which follow. 

Example 14 

Mapping of Ligand Interaction Sites on the Sweet Receptor 

15 

Through coexpression of T1R2R-H with human T1R3, part of the human sweet 
receptor (the N-terminal domain of T1R2) was replaced with rat protein sequence. The 
responses to aspartame and neotame are abolished, showing that the N- terminal 
domain of human T1R2 is required for recognizing aspartame and neotame. Similarly, 

20 the rat T1R2 N-terminal domain was also replaced with human protein sequence by 
coexpressing T1R2H-R with rat T1R3. The chimeric receptor gains the ability to 
respond to aspartame and neotame, suggesting that the same domain of human T1R2 is 
also sufficient (in the context of sweet receptors) to recognize those two sweeteners 
(Fig. 22B). These in vitro functional expression data indicate that the important 

25 interaction determinants are located in the N-terminal extracellular domain. 

In contrast, replacing either half of human T1R2 with rat protein sequence 
does not affect its response to cyclamate. Instead, the C-terminal domain of human 
T1R3 is required and sufficient, when co-expressed with T1R2, to recognize cyclamate 
(Fig. 22C). The transmembrane domain of family C GPCRs has been known to 

30 contain binding sites for allosteric modulators (Gasparini, F., R. Kuhn, and J.P. Pin, 
Curr Opin Pharmacol 2002 Feb;2(l):43-9). This is the first case in family C GPCR, 
where an agonist binds directly to the transmembrane domain and activates the 
receptor in the absence of other ligand. . 
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Lactisole, an aralkyl carboyxlic acid, is a specific human sweet taste 
inhibitor, which has physiological effect on the rodent taste. Consistent with the taste 
effect, lactisole inhibits the human but not rat T1R2/T1R3 response to sucrose in our 
assay system (Fig. 22A). The same kind of mapping experiments on lactisdl 
5 interaction site using the T1R chimeras was performed. Like cyclamate, lactisole 
requires the human T1R3 C-terminal domain to inhibit the receptor's response to 
sucrose and acesuifame K 

(Fig. 22D). This result further demonstrates the importance of T1R3 C-terminal 
domain in the sweet receptor function. The chimeras in all 16 possible combinations 
10 were tested, and all functional combinations generated results consistent with our 
model. 

Mutagenesis studies were conducted on both T1R2 and T1R3 to narrow down 
the essential amino acids in recognition of aspartame, neotame, and cyclamate. If 
T1R2 and T1R3 are responsible for recognizing different sweeteners, mutations in 

1 5 T1R2 N-terminal domain would affect responses to aspartame and neotame, but not 
cyclamate. In addition, mutations in T1R3 C-terminal domain would have the opposite 
effect. To select the crucial amino acid residues in the T1R2 N-terminal domain, the 
sequence of T1R2 was aligned with mGluRl (Fig. 23A). Among the eight residues 
that are crucial in ligand binding in mGluRl (Kunishima, N., et al., Nature, 2000. 

20 407(6807): p. 971-7), three are conserved in human T1R2 (S144, Y218, and E302). 

Each of the three residues were mutated and the resulting receptors were tested for 

their response to different sweeteners. Substitution of Y2 18 to A abolished the 

responses to all sweeteners tested, showing Y218 is important for the overall 

conformation of the receptor. The two other hTlR2 variants, containing S144A and 

25 E302A, selectively affected the response to aspartame and neotame but not cyclamate. 

■ 

Stable cell lines expressing S144A and E302A hTlR2 variants (coexpressed with wild 
type hTlR3 and G a \s) did not respond to aspartame or neotame at the physiological 
concentrations, but did respond to cyclamate (Fig. 23B). 

In order to further map the cyclamate-binding site, the three extracellular loops 
30 in the T1R3 C-terminal domain were focused on. Alignment of human and rodent 
TlR3s reveal multiple amino acid differences in the three extracellular loops (Fig. 
23C). Replacing extracellular loop-2 or loop-3 with rat sequences abolished the 
cyclamate response without affecting the sucrose or aspartame responses. In contrast, 
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replacing extracellular loop 1 had no obvious effect on response to cyclamate, showing 
an important role for EC loops 2 and 3 in recognizing cyclamate (Fig. 23D). None of 
those loop-replacements affected the inhibition effect of lactisole, showing a different 
binding mechanism. In summary, amino acid substitutions in T1R2 or T1R3 result in 
5 selective interference of activities induced by different sweeteners, consistent with the 
chimeric receptor results. 

The above results demonstrate that the human sweet receptor function as a 
heteromeric complex of T1R2 and T1R3. Both subunits are required for recognizing 
different sweeteners, and the data indicate the existence of multiple binding pockets 
10 on the receptor for different classes of agonists. The presence of multiple ligand- 
binding sites provides structural guidance and definition for the specifically binding 
compounds of the invention. 

Example 15 
Mapping of Receptor-G protein Interactions 

1 5 The human and rat sweet receptors are also different in their G protein-coupling 

efficiency. Even though both human and rat receptors can couple efficiently to Gaisnu 
only the human receptor can couple efficiently to Gais (Fig. 24A). This species 
difference allows for mapping of the receptor G protein interactions using the same 
chimeric receptors as described above. T1R2 but not T1R3 appears to be critical for 

20 G a i5-coupling, since replacing the C-terminus of human T1R2 with the corresponding 
rat sequence abolished coupling, and replacing rat T1R2 C-terminal half with human 
sequence enabled the receptor to couple to Gais and respond to sucrose and acesulfame 
K (Fig. 24); Swapping the T1R3 C-terminal sequences had no effect on Gois-coupling 
(Fig. 24B). This observation demonstrates the important role of T1R2 in G protein- 

25 coupling in the functional expression system. Gustducin (Wong, G.T., K.S. Gannon, 

and R.F. Margolskee, Nature, 1996. 381(6585): p. 796-800) has been proposed to be an 
endogenous G protein for the sweet taste receptor, and T1R2 can be the subunit 
responsible for in vivo coupling in taste cells. GABAbR is the other example of 
heteromeric family C GPCR, whereas one subunit (GABAbRI) is responsible for 

30 ligand- binding, and the other (GABAbR2) for G protein coupling (Margeta-Mitrovic, 
M., Paroc Natl Acad Sci U S A, 2001. 98(25): p. 14643-8; Margeta-Mitrovic, M., Proc 
Natl Acad Sci U S A, 2001. 98(25): p. 14649-54). The sweet receptor is different fiom 
GABAbR in that T1R2 is required for both ligand recognition and G- protein coupling. 
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Example 16 

Lactisole Antagonizes Human T1R1/T1R3 and Inhibits Human Umami 

Taste 

5 It was hypothesized that since T1R1/T1R3 function as heteromeric receptors 

as well as the sweet receptor, that lactisole should have similar effect on T1R1/T1R3 
activity, since T1R3 is a common subunit between the sweet and the umami receptors. 
Indeed, lactisole antagonized human T1R1/T1R3 (Fig. 25A). Lactisole acts as a 
noncompetitive inhibitor of T1R1/T1R3, since the IC50 values are apparently not 

10 dependent on glutamate concentration (Fig. 25B), and lactisole reduces the maximal 
activities of the receptor without significantly changing the EC50 of agonists (Fig. 
25C). These results demonstrate that lactisole binds to a different site from L- 
glutamate, and are consistent with the hypothesis that the glutamate-binding pocket is 
located in T1R1 . Lactisole appears to be a competitive inhibitor of the sweet receptor, 

15 as its ICsos are dependent on the concentrations of the sweeteners, and it increases the 
EC50S of the sweeteners without significantly affecting the maximal activities. 

The inhibition effect of lactisole is mediated by the T1R receptors since it had 
no effect on the endogenous muscarinic acetylcholine receptor in HEK cells or on a 
mouse bitter receptor, mT2R5, transiently expressed in HEK cells. As was the case 

20 for the T1R2/T1R3 receptor, lactisole inhibition of the T1R1/T1R3 response to 
umami taste stimuli was reversible following washout and restimulation. 

To correlate the receptor activity with behavior, the effect of lactisole on 
human umami taste was tested. As predicted, millimolar concentrations of lactisole 
dramatically increased detection thresholds for sweet and umami but not salt taste 

25 stimuli (Fig. 25D). Lactisole was previously not known to be an umami taste inhibitor. 
The correlation between receptor activity and taste results demonstrates a crucial role 
of TIRs in human umami taste. 

Example 17 

Cyclamate Enhances Human T1R1/T1R3 Receptor Activities 
30 Based on the same heteromeric model of TIRs (Fig. 26), it was predicted that 

cyclamate would also modulate the activity of the human T1R1/T1R3 umami receptor 
by acting on T1R3. Although cyclamate alone had no effect on T1R1/T1R3, it 
enhanced the activity of the receptor in the presence of L-glutamate (Fig. 27E). This 
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effect is specific for the human T1R1/T1R3, as cyclamate had no effect on the 
activities of the endogenous muscarinic acetylcholine receptor in the presence of 
carbachol (Fig. 27E). It is noteworthy that cyclamate has comparable EC50S for the 
sweet receptor (Fig. 23B) and umami receptor. Cyclamate reproducibly left-shifts the 
5 dose-response curves for L-glutamate by 

~2 fold in the presence or absence of IMP (Fig. 25F). IMP has a more dramatic 
effect of enhancing the receptor, and the effect of cyclamate is observed in the 
presence of IMP (Fig. 25F), suggesting a different mechanism from IMP in 
enhancing the receptor. IMP appears to bind to T1R1, since it has no effect on the 

10 sweet receptor. Other sweeteners, including sucrose, aspartame, saccharin, and D- 
tryptophan, had no effect on the human T1R1/T1R3 activities. 

In summary, it has been demonstrated that both T1R2 and T1R3 are required 
in a functional sweet receptor, that aspartame and neotame require the N-terminal 
extracellular domain of T1R2, G protein-coupling requires C-terminal half of T1R2, 

15 and that cyclamate and lactisole require the transmembrane domain of T1R3. These 
findings demonstrate the different functional roles of T1R subunits in a heteromeric 
complex and the presence of multiple sweetener interaction sites on the sweet 
receptor. Because T1R3 is the common subunit in the sweet and the umami receptors, 
it was predicted and confirmed the effect of cyclamate and lactisole on the umami 

20 receptor. Furthermore, a correlation was able to be made between the lactisole effect 
on the receptor activities with taste. Based on these observations, a model was created 
(Fig. 26) for the structure-function relationships of the T1R family taste receptors. 
Natural carbohydrate sweeteners bind to the N-terminal domain of T1R2, similar to 
aspartame and neotame, and there are other ligand binding sites on the sweet receptor 

25 as well, for example, the transmembrane domain of T1R2. The umami receptor 

functions similarly as a heteromeric complex, and MSG and IMP each appears to bind 
to the T1R1 subunit, since neither has any effect on the sweet receptor, and the 
transmembrane domain of T1R1 is responsible for coupling to G proteins. 
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Example 18 

HTS Protocol for Sweet Tastants 
An HEK293 cell line derivative (Chandrashekar, J., Mueller, K.L., Hoon, M.A., 
Adler, E., Feng, L., Guo, W., Zuker, C.S., Ryba, NJ.,. Cel,l 2000, 100, 703-71 1.) that 
5 stably expresses Gal5 and hTlR2/hTlR3 (Li, X., Staszewski, L., Xu, H., Durick, K., 
Zoller, M., Adler, E. Proc Natl Acad Sci USA 2002, 99, 4692-4696, World Patent # 
WO 03/001 876 A2, herein incorporated by reference in their entirety) was used to 
identify compounds with sweet taste enhancing properties. 

Compounds were initially selected based on their activity on the hTlR2/hTlR3- 

10 HEK293-Gal5 cell line (Li, et al. vide supra). Activity was determined using an 

automated fluorometric imaging assay on a FLIPR instrument (Fluorometric Intensity 
Plate Reader, Molecular Devices, Sunnyvale, CA) (designated FLIPR assay). Cells 
from one clone (designated S-9 cells) were seeded into 384-well plates (at 
approximately 50,000 cells per well) in a medium containing DMEM Low Glucose 

15 (Invitrogen, Carlsbad, CA), 10% dialyzed fetal bovine serum (Invitrogen, Carlsbad, 
CA), 100 Units/ml Penicillin G, and 100 jig/ml Streptomycin (Invitrogen, Carlsbad, 
CA) (Li, et al. vide supra) see also World Patent #WO 03/001876 A2). S-9 cells were 
grown for 24 hours at 37 °C. S-9 cells were then loaded with the calcium dye Fluo- 
3AM (Molecular Probes, Eugene, OR), 4 |iM in a phosphate buffered saline (D-PBS) 

20 (Invitrogen, Carlsbad, CA), for 1 hour at room temperature. After replacement with 25 
(xl D-PBS, stimulation was performed in the FLIPR instrument and at room temperature 
by the addition of 25 \i\ D-PBS supplemented with different stimuli at concentrations 
corresponding to twice the desired final level. Receptor activity was quantified by 
determining the maximal fluorescence increases (using a 480 nm excitation and 535 nm 

25 emission) after normalization to basal fluorescence intensity measured before 
stimulation. 

For dose-responses analysis, stimuli were presented in duplicates at 10 different 
concentrations ranging from 60 nM to 30 |uM. Activities were normalized to the 
response obtained with 400 mM D-fructose, a concentration that elicits maximum 
30 receptor response. EC50S were determined using a non-linear regression algorithm 
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(using Senomyx, Inc. software), where the Hill slope, bottom asymptotes and top 
asymptotes were allow to vary. Identical results were obtained when analyzing the 
dose-response data using commercially available software for non-linear regression 
analysis such as GraphPad PRISM (San Diego, CA). 

5 In order to determine the dependency of hTlR2/hTlR3 for the cell response to 

different stimuli, selected compounds were subjected to a similar analysis on HEK293- 
Gocl5 cells (not expressing the human sweet receptor). The HEK293-Gal5 cells do not 
.„ show any functional response in the FLIPR assay to D-Fructose or any other known 
sweeteners. Similarly, compounds described herein do not induce any functional 
10 response when using HEK293-Gal5 cells in the FLIPR assay. 

Example 19 

Flavor Enhancement Measurements for Sweet Tastants using Human Volunteers 

Basic screening of sensory taste testers: Potential panelists were tested for their 
abilities to rank and rate intensities of solutions representing the five basic tastes. 

1 5 Panelists ranked and rated intensity of five different concentrations of each of the five 
following compounds: sucrose (sweet), sodium chloride (salty), citric acid (sour), 
caffeine (bitter), and monosodium glutamate (umami). Panelists tasted a total of 25 
samples per session (5 samples of each of the 5 solution types). In the first session, 
panelists ranked the five concentrations for intensity of the attribute in question. This 

20 was repeated four more times with other samples. In the second session, panelists rated 
intensity of the five concentrations of each sample using a line scale called the 
"Labeled Magnitude Scale" (LMS). The LMS is anchored with intensities (e.g. barely 
detectable, weak, moderate, strong, very strong, and strongest imaginable) to assist 
panelists in rating the samples. Samples were tasted in 10ml volumes at room 

25 temperature and labeled with 3-digit blinding codes. Samples were presented in 
randomized, counterbalanced order within each sample solution (e.g. sucrose, citric 
acid, etc.). 

In order to be selected for participation in testing, panelists needed to coirectly 
rank and rate samples for intensity, with a reasonable number of errors. Approximately 
30 25 people successfully completed this procedure. 

Panelists selected in the above procedure were deemed qualified for performing 
Preliminary Taste Testing procedures. The preliminary taste tests are used to evaluate 
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new compounds for intensity of basic tastes and off-tastes. A small group of panelists 
(n=5) taste approximately 5 concentrations of the compound (range typically between 
1-lOOuM, in half-log cycles, e.g. 1, 3, 10, 30, and lOOuM) in water or buffer and in a 
solution of 4% (w/v, 117 mM) sucrose to evaluate enhancement. Typically samples 
5 also contain 0. 1 % ethanol in order to aid dispersion of the compound in a water-based 
solution. Panelists rate the five basic tastes (sweet, salty, sour, bitter, and umami) as 
well as off-tastes (such as chemical, metallic, sulfur) on the LMS. Samples are served 
in 10ml portions at room temperature. The purpose of the test is to determine the 
highest concentration at which there is no objectionable off-taste, and determine if 
1 0 obvious enhancement of sweet taste exists at any of the concentrations tested. 

If the compound is effective and does not have objectionable off-tastes, it is 
tested with a trained (expert panel) in a larger study. 

For example; Five panelists evaluated 1, 3, 10, 30, and lOOuM XVI-3 in water 
and in 4% sucrose solution. All samples with compound were balanced for ethanol at 
15 0.1 % (aids in dispersion of compound). Panelists were asked to rate basic tastes and 
off-tastes using the LMS for each sample tasted. When panelists noted sweetness in 
any sample, they were asked to taste reference samples of sucrose (2, 4, 6, 8% sucrose) 
to estimate equivalent sweetness. 

A trained (expert) panel was used to further evaluate compounds that had been 
20 tested with the preliminary taste test 

Panelists for the trained panel were selected from the larger group of qualifying 
taste panelists. Panelists were further trained on sweet taste by ranking and rating 
experiments using sucrose solutions. Panelists completed a series of ranking, rating, 
and difference from reference tests with sweet solutions. In ranking and rating 
25 experiments, panelists evaluated sucrose concentrations (2, 4, 6, 8 % (w/v)) sucrose. 

Compounds tested by the trained panel were evaluated in difference from 
reference experiments. Panelists were given reference samples of various 
concentrations (2,4,6, or 8 % (w/v) sucrose) and asked to rate samples on a scale of -5 
to +5 in terms of difference in sweet taste from the reference (score: -5= much less 
30 sweet taste than the reference; 0=same sweet taste as the reference; +5=much more 
sweet taste than the reference). Test samples were solutions with varying amounts of 
sucrose and compound. Typically, each session compared the reference sample 
(labeled as REF) to numerous test samples (labeled with 3-digit blinding codes). Tests 
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typically included various samples with varying concentrations of sucrose, as well as 
one blind sample of the reference itself, to evaluate panel accuracy. Compounds were 
tested against the reference in samples with and without 4% or 6% sucrose. All 
samples were presented in 10ml volumes at room temperature. Futhermore, to 
5 determine the sweetness of the compound alone, a reference solution was prepared at 
the designated concentration and compared to the threshold sweetness of sucrose (2%). 

Example 20 
HTS Protocol for Umami Tastants 
HEK-G a i5 cells were engineered to inducibly express T1R1/T1R3 using the 
10 GeneSwitch system (Invitrogen). pGene-derived zeocin-resistant expression vectors 
for human T1R1 and T1R3 (plasmid SXV603 for T1R1 and SXV61 1 for T1R3) and a 
puromycin-resistant pSwitch-derived vector that carries the GeneSwitch protein 
(plasmid SXV628) were linearized and cotransfected into the HEK-G a t5 cell line. 
Zeocin-resistant and puromycin-resistant colonies were selected, expanded, induced 
15 with variable amounts of mifepristone, and tested by calcium imaging for responses to 
umami taste stimuli. Cells were selected in 0.5 fig/ml puromycin (CAL BIOCHEM) 
and 100 ng/ml Zeocin (Invitrogen) at 37°C in Dulbecco's Modified Eagle Medium 
supplemented with GlutaMAX, (10 % dialyzed FBS, and 3 ug/ml blasticidin. Resistant 
colonies were expanded, and their responses to umami taste stimuli following induction 
20 with 1 0' 10 M mifepristone determined by fluorescence microscopy following the 
methods of Li, et al, PNAS (2002) 99(7):4692-4696. For automated fluorometric 
imaging on FLIPR instrumentation (Molecular Device), cells from one clone 
(designated clone 1-17) were seeded into 96- or 384-well plates (approximately 80,000 
cell per well) in the presence of 10 -t0 M mifepristone and incubated for 48 hours. Cells 
25 were then loaded with the calcium dye fluo-4-AM (Molecular Probes), 3 pM in PBS, 
for 1.5 hours at room temperature. After replacement with 50 jil PBS, stimulation was 
performed at room temperature by the addition of 50 pi PBS supplemented with 
different stimuli. The maximal fluorescence from four independent determinations 
were averaged, corrected for background fluorescence measured prior to compound 
30 addition, and normalized to the response to 0.002 mM ionomycin (CALBIOCHEM). 

Example 21 
Taste Test Protocol for Umami Tastants 
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Basic Training of Sensory Tasters: Tasters were trained to evaluate the taste of 
aqueous solutions (5 mL each, "swash and spit") of the following standard taste 
compounds by using the triangle test as described in the literature: sucrose (50 mM) for 
sweet taste; citric acid (5 mM) or lactic acid (20 mM) for sour taste; NaCl (12 mM) for 
5 salty taste, quinine (10 fiM) or caffeine (1 mM) for bitter taste; and monosodium 
glutamate (8 mM) for umami or "savory" taste. 

Training for Umami Taste: Tasters were given 1-3 sets of 6 MSG and/or MSG- 
IMP samples ranging from 3-60 mM MSG and 0-200 nM IMP, each arranged in the 
tray in ascending concentration. This exercise gave the subject practice doing dose 
1 0 response evaluations. Then another set was made up of the same six samples, but were 
given in random order. The subject was then asked to arrange the samples in ascending 
intensity and then to rate their umami intensity. 

Qualifying Taste Panelists: Tasters were subjected to a standard two alternative 
forced choice (2AFC) test with 5 pairs of taste samples. They were asked to make a 
15 choice of the most umami sample from two samples (a pair). The test contains two 
easy pairs, two with medium difficulty, and one difficult pair. Tasters who could 
differentiate the medium difficulty pairs were selected as panelists. 

Pilot/Qualitative taste test of Umami Enhancer Candidate (UEC) by a small 
group of panelists: Taste samples of appropriate concentrations (usually 1-50 jxM) 
20 were made in water (use minimum amount of ethanol if not soluble); Taste UEC alone 
at 30 and/or 50 nM for umami and other attributes. Rate those taste attributes on the 
appropriate Labeled Magnitude Scale (LMS) on the screening ballot; if UEC has 
no/low umami and other tastes, then move forward to discrimination test; compare 
certain concentration of MSG, e.g., 12 mM and 12 mM MSG + 30 nM UEC to 
25 determine if there is any enhancement; rate the perceived umami intensity on the 
appropriate LMS on the screening ballot; vary concentration of UEC and/or MSG to 
find the best combination; decide what solutions to use in panel screening; record all 
procedures and data including description of study, sample prep, sample arrangement, 
ballots and sign up sheet for panelists, data entry and evaluation. 
30 2AFC Panel Screening of UEC: Run panel screening with qualified panelists 

using protocols generated from the pilot tasting; record all procedures and data; prepare 
summary report with statistically significant conclusions, if any. 
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Example 22 

Quantitative Taste Tests for Compounds 2725761 and 3756807 

Quantitative taste tests for compounds 2725761 and 3756807 were run 
according to procedures presented above. It was found that both of them have some 
5 enhancement for MSG, in addition to their additive effect of the umami intensity. 

Example 23 

Synthesis of Compounds 2725761 and 3756807 

Compounds 2725761 and 3756807 are prepared as shown in Example 22, from 
10 their corresponding acids and amines. The products are purified by conventional 

methods, e.g., basic and acidic aqueous washes, or preparative HPLC. The structures 
of those compounds were confirmed based on usual analytical methods, e.g. , NMR 
and LCMS. This method can also be used to synthesize any of the compounds found in 
Tables 1-5. 

15 

Example 24 
Cell Based Assays 
Cells were grown and maintained at 37°C in Dulbecco's Modified Eagle 
Medium (DMEM) supplemented with 10% FBS and MEM non-essential amino acids 

20 (Gibco BRL); media for G a is cells also contained 3 jxg ml" 1 blasticidin (Gibco BRL). 
For calcium-imaging experiments, cells were first seeded onto 48-well tissue-culture 
plates (approximately 30,000 cells per well), and transfected using Minis TransIt-293 
(PanVera). Transfection efficiencies, which were estimated by cotransfection with an 
RFP expression vector, were typically approximately 60%. To minim ize glutamate- 

25 induced and glucose-induced desensitization, supplemented DMEM was replaced with 
low-glucose DMEM supplemented with GlutaMAX and 10% dialyzed FBS (Gibco 
BRL) approximately 24 hours after transfection. After an additional 24 hours, cells 
were loaded with the calcium dye fluo-4-AM (Molecular Probes), 3 \M in Dulbecco's 
PBS buffer (DPBS, GibcoBRL), for 1 .5 hours at room temperature. After replacement 

30 with 100 pi DPBS, stimulation was performed at room temperature by addition of 100 
pi DPBS supplemented with taste stimuli. Calcium mobilization was monitored on an 
Axiovert S100 microscope equipped with an inverted 10X/0.5 LWD piano fluor 
objective (Zeiss) and a cooled CCD camera (Princeton Instruments). Fluorescence 
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images were acquired at 480 nm excitation and 535 nm emission, and analyzed with 
Imaging Workbench 4.0 software (Axon Instruments). T1R receptor activity was 
quantitated by counting the number of responding cells 30 seconds after stimulus 
addition. 
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What is claimed: 

1 . A non-naturally occurring compound that specifically binds to a T1R2/T1R3 
receptor composed of hTlR2/hTlR3 but not rTlR2/rTlR3. 

2. A non-naturally occurring compound that specifically binds to a T1R2/T1R3 
receptor composed of hTlR2/rTlR3 but not iTlR2/hTlR3. 

3. A non-naturally occurring compound that specifically binds to the N-terminal 
extracellular domain of T1R2 of the hTlR2/hTlR3 receptor. 

4. A non-naturally occurring compound that specifically binds to a T1R2/T1R3 
receptor composed of rTlR2/hTlR3 but not hTlR2/rTlR3. 

5. A non-naturally occurring compound that specifically binds to hTlR2/hTlR3 
and rTlR2/r3-h3 but not to rTlR2/mR3 or to hTlR2/h3-r3. 

6. A non-naturally occurring compound that specifically binds to hTlR2/hTlR3 
and r2-h2/rTlR3 but not to rTlR2/rTlR3 or to h2-r2/hTlR3. 

7. A non-naturally occurring compound that specifically binds to the Venus 
Flytrap Domain (VFD) of T1R2 of the hTlR2/hTlR3 and hTlR2/rTlR3 
receptor. 

8. A non-naturally occurring compound that specifically binds to amino acid 
residues 144 and 302 of the human N-terminal Venus flytrap domain of the 
T1R2 subunit of theTlR2/TlR3 receptor. 

9. A non-naturally occurring compound that specifically binds to the N-terminal 
Venus flytrap domain of the T1R2 subunit of the T1R2/T1R3 receptor, wherein 
the compound is about 12x5x5 angstroms. 
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10. A non-naturally occurring compound that specifically binds to the cysteine-rich 
region of T1R2 of the hTlR2/hTlR3 receptor. 

11. A non-naturally occurring compound that specifically binds to the 
Transmembrane Domain (TM) of T1R2 of the hTlR2/hTlR3 receptor. 

. 12. A non-naturally occurring compound that specifically binds to the human N- 
terminal extracellular domain of the T1R3 subunit of the T1R2/T1R3. 

13. A non-naturally occurring compound that specifically binds to the Venus 
Flytrap Domain (VFD) of T1R3 of the hTlR2/hTlR3 receptor. 

14. A non-naturally occurring compound that specifically binds to the 
Transmembrane Domain (TM) of T1R3 of the hTlR2/hTlR3 receptor. 

15. A non-naturally occurring compound that specifically binds to extracellular 
loop 2 and extracellular loop 3 of the human transmembrane domain of the 
T1R3 subunit of T1R2/T1R3. 

16. The compound of any one of claims 1-15 that demonstrates compound- 
dependent increase in fluorescence with an activity compared to the maximal 
activity for fructose of at least 25% in a fluorescence-based assay using a 
FLIPR (Molecular Devices) instrument. 

17. The compound of any of claims 1-16 that demonstrates a compound-dependent 
decrease in the EC50 for a sweetener by at least two-fold in a fluorescence-based 
assay using a FLIPR (Molecular Devices) instrument. 

18. The compound of any one of claims 1-17 that results in at least 10 out of 100 
cells transfected with wild-type or chimeric receptor showing a compound- 
dependent increase in fluorescence. 
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19. The compound of any one of claims 1-18 that demonstrates a compound- 
dependent increase of at least two-fold of the number of fluorescent cells in 
response to a sub-maximal level of a sweetener. 

20. The compound of any one of claims 1-16 that demonstrates a compound- 
dependent increase in the response of cells to a sub-maximal level of a 
sweetener of at least 1 .25-fold compared to the response to the sweetener alone. 

21. The compound of claim 20, wherein the response is measured by fluorescence, 
calcium levels, IP3 levels, cAMP levels, GTPyS binding, or reporter gene 
activity (e.g. luciferase, beta-galactosidase). 

22. The compound of any one of claims 1-21, having one or more of the following 
characteristics in a cell: 

decreased EC50 compared to a control of at 50%, 
increased intracellular Ca2+ levels by at least approximately 25%, 
increased intracellular cAMP by at least approximately 25%, 
increased intracellular cGMP by at least approximately 25%, 
increased intracellular IP3 by at least approximately 25%, or 
increased G protein binding of GTPyS by at least approximately 25%. 

23. A chimeric T1R2/T1R3 receptor comprising, a human T1R2 subunit and a rat 
T1R3 subunit. 

24. A chimeric T1R2/T1R3 receptor comprising, a rat T1R2 subunit and a human 
T1R3 subunit. 

25. A chimeric T1R2 receptor subunit comprising, a human extracellular domain, a 
rat transmembrane domain and a rat intracellular domain. 

26. A chimeric T1R3 receptor subunit comprising, a rat extracellular domain, a 
human transmembrane domain and a human intracellular domain. 
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27. A non-naturally occurring compound that binds to the N-terminal extracellular 
domain of T1R1 of the TlRl/hTlR3 receptor. 

28. A non-naturally occurring compound that binds to the T1R1VFD of the 
T1R1/T1R3 savory receptor. 

29. A non-naturally occurring compound that binds to the cysteine-rich region of 
T1R1 of the TlRl/hTlR3 receptor. 

30. A non-naturally occurring compound that binds to the T1R1 TM domain of the 
T1R1/T1R3 savory receptor. 

3 1 . A non-naturally occurring compound that binds to the N-tenninal extracellular . 
domain of T1R3 of the TlRl/hTlR3. receptor. 

32. A non-naturally occurring compound that binds to the T1R3 VFD of the 
T1R1/T1R3 savory receptor. 

33. A non-naturally occurring compound that binds to the cysteine-rich region of 
T1R3 of the TlRl/hTlR3 receptor. 

34. A non-naturally occurring compound that binds to the T1R3 TM domain of the 
T1R1/T1R3 savory receptor. 

35. A non-naturally occurring compound that binds to the TM domain of T1R1 of a 
truncated savory receptor composed of the hlTM/h3TM. 

36. A non-naturally occurring compound that binds to the TM domain of T1R3 of a 
truncated sweet receptor composed of hi TM/h3TM. 

37. A non-naturally occurring compound that binds to the TM domain of T1R1 of a 
chimeric receptor composed of mGluR-hl/mGluR-h3. 
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38. A non-naturally occurring compound that binds to the TM domain of T1R3 of a 
chimeric receptor composed of mGluR-hl/mGluR-h3, 

39. The compound of any one of claims 27-38 that demonstrates compound- 
dependent increase in fluorescence with an activity compared to the maximal 
activity of glutamate of at least 25% in a fluorescence-based assay using a 
FLIPR (Molecular Devices) instrument. 

40. The compound of any one of claims 27-38 that demonstrates a compound- 
dependent decrease in the EC50 for glutamate by at least two-fold in a' 
fluorescence-based assay using a FLIPR (Molecular Devices) instrument. 

41 . The compound of any one of claims 27-38 that results in at least 10 out of 100 
transfected cells showing a compound-dependent increase in fluorescence 
measured with a fluorescent microscope. 

42. The compound of any one of claims 27-38 that results in a compound- 
dependent increase of at least two-fold of the number, of fluorescent cells in 
response to a sub-maximal level of glutamate. 

43. The compound of any one of claims 27-38 that results in a compound- 
dependent increase in the response of cells to a sub-maximal level of glutamate 
of at least 1.25-fold compared to the response to glutamate alone. 

44. The compound of claim 43, wherein the response is measured by fluorescence, 
calcium levels, IP3 levels, cAMP levels, rrnyS binding, or reporter gene 
activity. 

45. A method for identifying compounds that modulate taste perception by 
identifying compounds that bind to, activate, inhibit, enhance and/or modulate 
one or more of the receptors of any one of claims 23-26. 
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46. A method for identifying a compound that modulates sweet taste perception 
comprising comparing the effect of the compound on a sweet receptor to the 
effect of a compound of any one of claims 1-22, an enhancement of sweet 
perception approximately equal to or greater than the sweet enhancement of the 
compound indicating a compound that enhances sweet perception, 

47. A method for identifying a compound that modulates umami taste perception 
comprising comparing the effect of the compound on an umami receptor to the 
effect of a compound of any one of claims 27-44, an enhancement of savory 
perception approximately equal to or greater than the savory enhancement of 
the compound indicating a compound that enhances umami perception, 

48. A method for identifying compounds that modulate taste perception by 
identifying compounds that bind to, activate, inhibit, and/or modulate a receptor 
expressed by a cell that stably expresses one or more of the receptors of any one 
of claims 23-26. 

49. A method for modulating the savory taste of a comestible or medicinal product 
comprising: 

providing at least one comestible or medicinal product, or a precursor 
thereof, and 

combining the comestible or medicinal product or precursor thereof with at 
least a savory flavor modulating amount of at least one non-naturally 
occurring compound of any one of claims 27-44, or a comestibly 
acceptable salt thereof, so as to form a modified comestible or medicinal 
product; 

thereby modulating the savory taste of a comestible or medicinal product. 

50. A method for inhibiting the savory taste of a comestible or medicinal product 
comprising: 

providing at least one comestible or medicinal product, or a precursor 
thereof, and 
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combining the comestible or medicinal product or precursor thereof with at 
least a savory flavor inhibiting amount of at least one non-naturally 
occurring compound of any one of claims 27-44, or a comestibly 
acceptable salt thereof, so as to form a modified comestible or medicinal 
product; 

thereby inhibiting the savory taste of a comestible or medicinal product. 

51. A method for increasing the savory taste of a comestible or medicinal product 
comprising: 

providing at least one comestible or medicinal product, or a precursor 
thereof, and 

combining the comestible or medicinal product or precursor thereof with at 
least a savory flavor increasing amount of at least one non-naturally 
occurring compound of any one of claims 27-44, or a comestibly 
acceptable salt thereof, so as to form a modified comestible or medicinal 
product; 

thereby increasing the savory taste of a comestible or medicinal product. 

52. A method for modulating the sweet taste of a comestible or medicinal product 
comprising: 

providing at least one comestible or medicinal product, or a precursor 
thereof, and 

combining the comestible or medicinal product or precursor thereof with at 
least a sweet flavor modulating amount of at least one non-naturally 
occurring compound of any one of claims 1-22 and 60-61, or a comestibly 
acceptable salt thereof so as to form a modified comestible or medicinal 
product; 

thereby modulating the sweet taste of a comestible or medicinal product. 

53. A method for inhibiting the sweet taste of a comestible or medicinal product 
comprising: 
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providing at least one comestible or medicinal product, or a precursor 
thereof, and 

combining the comestible or medicinal product or precursor thereof with at 
least a sweet flavor inhibiting amount of at least one non-naturally 
occurring compound of any one of claims 1-22 and 60-61, or a comestibly 
acceptable salt thereof, so as to form a modified comestible or medicinal 
product; 

thereby inhibiting the sweet taste of a comestible or medicinal product. 

54. A method for increasing the sweet taste of a comestible or medicinal product 
comprising: 

providing at least one comestible or medicinal product, or a precursor 
thereof, and 

combining the comestible or medicinal product or precursor thereof with at 
least a sweet flavor increasing amount of at least one non-naturally 
occurring compound of any one of claims 1-22 or 60-61, or a comestibly 
acceptable salt thereof, so as to form a modified comestible or medicinal 
product; 

thereby increasing the sweet taste of a comestible or medicinal product. 

55. A method of enhancing umami taste perception comprising contacting an 
umami receptor with cyclamate and NHDC, and their derivatives. 

56. A method of enhancing umami taste perception comprising contacting an 
umami receptor with lactisole derivatives. 

57. A method of enhancing sweet taste perception comprising contacting a sweet 
receptor with cyclamate and NHDC, and their derivatives. 

58. A method of enhancing sweet taste perception comprising contacting a sweet 
receptor with lactisole derivatives. 

59. The compound of any one of claims 27-44, wherein the compound is not 
sucrose, fructose, glucose, erythritol, isomalt, lactitol, mannitol, sorbitol, xylitol, 
certain known natural terpenoids, flavonoids, or protein sweeteners, di-peptides, 
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tri-peptides aspartame, saccharin, sucralose, halogenated saccharides, 
acesulfame-K, cyclamate, sucralose, alitame, monosodium glutamate ("MSG"), 
inosine monophosphate (IMP), adenosine monophosphate, or guanosine 
monophosphate (GMP). 

60. The compound of any one of claims 1-22, wherein the compound is not sucrose, 
fructose, glucose, erythritol, isomalt, lactitol, mannitol, sorbitol, xylitol, certain 
known natural terpenoids, flavonoids, or protein sweeteners, di-peptides, tri- 
peptides, aspartame, saccharin, sucralose, halogenated saccharides, acesulfame- 
K, cyclamate, sucralose, and alitame. neotame, perillartine, SC-45647, SC- 
40014, monellin, NC-002740-01, thaumatin, CC-00100, NC-00420, alitame, 
SC-44102, dulcin, NC-00576, slycyrrhizic Acid, stevioside, Na-Saccharin, D- 
tryptophan, cyclamate, DHB, glycohc Acid, glycine, D (-)fiructose, 
homofuronol, D (-) tagatose, maltose, D (+) glucose, D-sorbitol, D (+) 
galactose, a-lactose, LOfructose, L (+), compound 403249, or glucose. 

61 . The compound of any one of claims 1-22, wherein the compound is not sucrose, 
fructose, glucose, erythritol, isomalt, lactitol, mannitol, sorbitol, xylitol, certain 
known natural terpenoids, flavonoids, or protein sweeteners, di-peptides, tri- 
peptides, aspartame, saccharin, sucralose, halogenated saccharides, acesulfame- 
K, cyclamate, sucralose, and alitame. neotame, perillartine, SC-45647, SC- 
40014, monellin, NC-002740-01 , thaumatin, CC-001 00, NC-00420, alitame, 
SC-44102, dulcin, NC-00576, slycyrrhizic Acid, stevioside, Na-Saccharin, D- 
tryptophan, cyclamate, DHB, glycolic Acid, glycine, D (-)fructose, 
homofuronol, D (-) tagatose, maltose, D (+) glucose, D-sorbitol, D (+) 
galactose, a-lactose, LOfructose, L (+), compound 403249, glucose, or 
Compound 6364395. 

62. The compound of any one of claims 27-44, wherein the compound is not 
sucrose, fructose, glucose, erythritol, isomalt, lactitol, mannitol, sorbitol, xylitol, 
certain known natural terpenoids, flavonoids, or protein sweeteners, di-peptides, 
tri-peptides aspartame, saccharin, sucralose, halogenated saccharides, 
acesulfame-K, cyclamate, sucralose, alitame, monosodium glutamate ("MSG"), 
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inosine monophosphate (IMP), adenosine monophosphate, or Compound 
6364395, guanosine monophosphate (GMP). 
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Figure 1 : Catalog of human and rat n ks 
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Figure 2 hT1R2 and hT1R3 are expressed in human tongue 
epithelium. cDNA-specific amplification products can be 
amplified from cDNA prepared from resected human 
clrcumvallate papillae. 
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Figure 3 Human T1R2/T1R3 functions as a sweet taste 
receptor 
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Figure 4 T1 R2 may control T1 R2/T1 R3 ligand specificity 

Sucrose Aspartame 
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Figure 5 
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Figure 7 Key ligand-binding residues of mGlurRI are 
conserved in T1R1 
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Figure 8 Human T1R1/T1R3 functions as an umami taste 
receptor 
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B. 



HMT-Myc H2-PD2P-n-Myc 




Figure 9 PD2IP facilitate the surface expression of human 
T1R2. 

A. Immunofluorescence staining of Myc-tagged hT1 R2 
indicates that PDZIP significantly increases the 
amount of human T1 R2 protein on the plasma 
membrane. 

B. FACS analysis data demonstrating the same result. 

Myc-tagged human T1R2: Green line. Myc- 
tagged 

C. human T1R2 with PDZIP: black line. 
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Figure 10 Calcium-imaging data demonstrating hT1R2/hT1R3 
responses to a number of sweet stimuli. 
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Figure 11 



hT1R1 ♦ hT1R3, constant IMP (0.5mM) 
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Figure 12 
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Figure 13 
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Figure 14 
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Figure 1 6 Lactisole Inhibits the T1R2/T1R3 sweet and T1R1/T1R3 umaml receptors and 
sweet and umaml taste. {Left panel) responses of HEK-G a18 cells transiently transfected with 
T1R1/T1R3 (circles) to 10 mM L-glutamate and HEK-G a16 cells transiently transfected with 
T1R2/T1R3 (squares) to 150 mM sucrbse In the presence of variable concentrations of lactisole 
are shown. (Right panel) fold increases In taste detection thresholds in the presence of 1 and 2 mM 
lactisole are shown for the sweet taste stimuli sucrose and D-tryptophan, the umaml taste stimuli L- 
glutamate (MSG) and L-glutamate plus 02 mM IMP, and sodium chloride. Detection thresholds 
were determined following the method of Schfffman et al. 
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Figure 18 
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Figure 19 
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Figure 20 
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Figure 21A 
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Figure 21B 
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Fig. 24 
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SEQUENCE LISTING 

<110> Senomyx, Inc. 

<120> T1R HETERO-OLIGOMERIC TASTE RECEPTORS , 

CELL LINES THAT EXPRESS SAID RECEPTORS , AND TASTE COMPOUNDS 



<130> 19328. OOOlul 

<140> Uhassigned 
<141> 2004-08-06 

<150> 60/494,071 
<151> 2003-08-06 

<150> 60/552,064 
<151> 2004-03-09 

<160> 33 

<170> FaetSEQ for Windows Version 4.0 

<210> 1 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note ■ 
synthetic construct 

<400> 1 

Ser Val Ser Thr Trp 
1 5 

<210> 2 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<221> VARIANT 

<222> 1,3,4,6,7,8,11,12,13 

<223> At position 1, Xaa can be either Thr or Arg 
At position 3, Xaa can be either Phe or Leu 
At position 4, Xaa can be either Arg, Gin, or Pro 
At position 6, Xaa can be either Arg or Thr 
At position 7, Xaa can be either Ser, Pro or Val 
At position 8, Xaa can be either Val, Glu, Arg, 
Lys or Thr 

At position 11, Xaa can be either Ala or Glu 
At position 12, Xaa can be either Trp or Leu 
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At position 13, Xaa can be either Arg, His or Gly 
<400> 2 

Xaa Cys Xaa Xaa Arg Xaa Xaa Xaa Phe Leu Xaa Xaa Xaa Glu 
1 5 10 

<210> 3 
<211> 15 
<212> PRT 

<213> Artificial Sequence . 
<220> 

<223> Description of Artificial Sequence/ note » 
synthetic construct 

<221> VARIANT 

<222> 1,3,4,7,9,10,11,13,14,15 

<223> At position 1, Xaa can be either Leu or Gin 

At position 3, Xaa can be either Glu, Gly or Thr 
At position 4, Xaa can be either Asn, Arg or Cys 
At position 7, Xaa can be either Arg or Glu 
At position 9, Xaa can be either Arg or Lys 
At position 10, Xaa can be either Cys, Gly or Phe 
At position 11, Xaa can be either Val, Leu or lie 
At position 13,. Xaa can be either Phe or Leu 
At position 14, Xaa can be either Ala or Ser 
At position 15, Xaa can be either Met or Leu 

<400> 3 

Xaa Pro Xaa Xaa Tyr Asn Xaa Ala Xaa Xaa Xaa Thr Xaa Xaa Xaa 
15 10 15 

<210> 4 
<211> 858 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 



<400> 4 



Met 


Pro 


Gly 


Leu 


Ala 


He 


Leu 


Gly 


Leu 


Ser 


Leu 


Ala Ala Phe Leu Glu 


1 








5 










10 




15 


Leu 


Gly 


Met 


Gly 


Ser 


Ser 


Leii 


Cys 


Leu 


Ser 


Gin 


Gin Phe Lys Ala Gin 








20 










25 






30 


Gly 


Asp 


Tyr 


lie 


Leu 


Gly 


Gly 


Leu 


Phe 


Pro 


Leu 


Gly Thr Thr Glu Glu 






35 










40 








45 


Ala 


Thr 


Leu 


Asn 


Gin 


Arg 


Thr 


Gin 


Pro 


Asn 


Gly 


He Leu Cys Thr Arg 




50 










55 










60 


Phe 


Ser 


Pro 


Leu 


Gly 


Leu 


Phe 


Leu 


Ala 


Met 


Ala 


Met Lys Met Ala Val 


65 










70 










75 


80 


Glu 


Glu 


lie 


Asn 


Asn 


Gly 


Ser 


Ala 


Leu 


Leu 


Pro 


Gly Leu Arg Leu Gly 










85 










90 




95 


Tyr 


Asp 


Leu 


Phe 


Asp 


Thr 


Cys 


Ser 


Glu 


Pro 


Val 


Val Thr Met Lys Pro 








100 










105 






110 


Ser 


Leu 


Met 


Phe 


Met 


Ala 


Lys 


Val 


Gly 


Ser 


Gin 


Ser He Ala Ala Tyr 
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115 



Cys 


Asn 


Tyr 


Thr 


Gin 


Tyr 




130 










His 


Ser 


Ser 


Glu 


Leu 


Ala 


145 










150 


Leu 


Met 


Pro 


Gin 


Val 


Ser 










165 




Arg 


Glu 


Thr 


Phe 


Pro 


Ser 








180 






Gin 


Leu 


Gin 


Ala 


Val 


Val 






195 








Val 


Ala 


Ala 


Leu 


Gly 


Ser 




210 










lie 


Phe 


Ser 


Gly 


Leu 


Ala 


225 










230 


Gly 


Leu 


Val 


Pro 


Gin 


His 










245 




Val 


Asp 


Val 


Leu 


Arg 


Gin 








260 






Leu 


Phe 


Ala 


Ser 


Ala 


Arg 






275 








Leu 


His 


Asp 


Leu 


Ser 


Pro 




290 










Thr 


Ser 


Asp 


Leu 


Val 


Met 


305 










310 


Val 


Leu 


Gly 


Phe 


Leu 


Gin 










325 




Tyr 


Val 


Glu 


Thr 


Arg 


Leu 








340 






Ser 


Leu 


Lys 


Ala 


Glu 


Leu 






355 








Cys 


Ser 


Gin 


Cys 


Asp 


Tyr 




370 










Met 


Gin 


Asn 


Leu 


Ser 


Ala 


385 










390 


Tyr 


Ala 


Ala 


Val 


Tyr 


Ser 










405 




Cys 


Asn 


Val 


Ser 


His 


Cys 








420 






Leu 


Leu 


Glu 


Asn 


Met 


Tyr 






435 








Leu 


Gin 


Phe 


Asp 


Ala 


Lys 




450 










Met 


Trp 


Val 


Trp 


Gin 


Ser 


465 










470 


Phe 


Asn 


Gly 


Thr 


Leu 


Gin 










485 




Asn 


Gin 


Val 


Pro 


Val 


Ser 








500 






Val 


Arg 


Arg 


Val 


Lys 


Gly 






515 








Cys 


Lys 


Ala 


Gly 


Ser 


Tyr 




530 










Pro 


Cys 


Gly 


Lys 


Asp 


Gin 


545 










550 


Pro 


Arg 


Arg 


Pro 


Lys 


Phe 



565 



120 



Gin 


Pro 


Arg 


Val 


Leu 


Ala 


135 










140 


Leu 


He 


Thr 


Gly 


Lys 


Phe 










155 




Tyr 


Ser 


Ala 


Ser 


Met 


Asp 








170 






Phe 


Phe 


Arg 


Thr 


Val 


Pro 






185 








Thr 


Leu 


Leu 


Gin 


Asn 


Phe 




200 










Asp 


Asp 


Asp 


Tyr 


Gly 


Arg 


215 










220 


Asn 


Ser 


Arg 


Gly 


He 


Cys 










235 




Asp 


Thr 


Ser 


Gly 


Gin 


Gin 








250 






Val 


Asn 


Gin 


Ser 


Lys 


Val 






265 








Ala 


Val 


Tyr 


Ser 


Leu 


Phe 




280 










Lys 


Val 


Trp 


Val 


Ala 


Ser 


295 










300 


Thr 


Leu- 


Pro 


Asn 


He 


Ala 










315 




Arg 


Gly 


Ala 


Leu 


Leu 


Pro 








330 






Ala 


Leu 


Ala 


Ala 


Asp 


Pro 






345 








Asp 


Leu 


Glu 


Glu 


Arg 


Val 




360 










He 


Met 


Leu 


Gin 


Asn 


Leu 


375 










360 


Gly 


Gin 


Leu 


His 


His 


Gin 










395 




Val 


Ala 


Gin 


Ala 


Leu 


His 








410 






His 


Thr 


Ser 


Glu 


Pro 


Val 






425 








Asn 


Met 


Ser 


Phe 


Arg 


Ala 




440 










Gly 


Ser 


Val 


Asp 


Met 


Glu 


455 










460 


Pro 


Thr 


Pro 


Val 


Leu 


His 










475 




Leu 


Gin 


His 


Ser 


Lys 


Met 








490 






Gin 


Cys 


Ser 


Arg 


Gin 


Cys 






505 








Phe 


His 


Ser 


Cys 


Cys 


Tyr 




520 










Arg 


Lys 


His 


Pro 


Asp 


Asp 


535 










540 


Trp 


Ser 


Pro 


Glu 


Lys 


Ser 










555 




Leu 


Ala 


Trp 


Gly 


Glu 


Pro 



570 
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125 




Val He Gly 


Pro 


Phe Ser Phe 


Phe 




16.0 


Arg Leu Ser 


Asp 


175 




Ser Asp Arg 


Val 


190 




Ser Trp Asn 


Trp 


205 




Glu Gly Leu 


Ser 


He. Ala His 


Glu 




240 


Leu Gly Lys 


Val 


255 




Gin Val Val 


Val 


270 




Ser Tyr Ser 


He 


285 




Glu Ser Trp 


Leu 


Arg Val Gly 


Thr 




320 


Glu Phe Ser 


His 


335 




Thr Phe Cys 


Ala 


350 




Met Gly Pro 


Arg 


365 




Ser Ser Gly 


Leu 


He Phe Ala 


Thr 




400 


Asn Thr Leu 


Gin 


415 




Gin Pro Trp 


Gin 


430 




Arg Asp Leu 


Thr 


445 




Tyr Asp Leu 


Lys 


Thr Val Gly 


Thr 




480 


Tyr Trp Pro 


Gly 


495 




Lys Asp Gly 


Gin 


510 




Asp Cys Val 


Asp 


525 




Phe Thr Cys 


Thr 


Thr Thr Cys 


Leu 




560 


Ala Val Leu 


Ser 


575 
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Leu 


Leu 


Leu 


Leu Leu Cys Leu 


Val Leu 


Gly 


Leu 


Thr 


Leu 


Ala 


Ala 


Leu 








580 


585 










590 






Gly 


Leu 


Phe 


Val His Tyr Trp 


Asp Ser 


Pro 


Leu 


Val 


Gin 


Ala 


Ser 


Gly 






595 




600 








605 








Gly 


Ser 


Leu 


Phe Cys Phe Gly 


Leu lie 


Cys 


Leu 


Gly 


Leu 


Phe 


Cys 


Leu 




610 




615 








620 










Ser 


Val 


Leu 


Leu Phe Pro Gly 


Arg Pro 


Arg 


Ser 


Ala 


Ser 


Cys 


Leu 


Ala 


625 






630 






635 










640 


Gin 


Gin 


Pro 


Met Ala His Leu 


Pro Leu 


Thr 


Gly 


Cys 


Leu 


Ser 


Thr 


Leu 








645 




650 










655 




Phe 


Leu 


Gin 


Ala Ala Glu lie 


Phe Val 


Glu 


Ser 


Glu 


Leu 


Pro 


Leu 


Ser 








660 


665 










670 






Trp 


Ala 


Asn 


Trp Leu Cys Ser 


Tyr Leu 


Arg 


Gly 


Pro 


Trp 


Ala 


Trp 


Leu 






675 




680 








685 








Val 


Val 


Leu 


Leu Ala Thr Leu 


Val Glu 


Ala 


Ala 


Leu 


Cys 


Ala 


Trp 


Tyr 




690 




695 








700 










Leu 


Met 


Ala 


Phe Pro Pro Glu 


Val Val 


Thr 


Asp 


Trp 


Gin 


Val 


Leu 


Pro 


705 






710 






715 










720 


Thr 


Glu 


Val 


Leu Glu His Cys 


Arg Met 


Arg 


Ser 


Trp 


Val 


Ser 


Leu 


Gly 








725 




730 










735 




Leu 


Val 


His 


lie Thr Asn Ala 


Val Leu 


Ala 


Phe 


Leu 


Cys 


Phe 


Leu 


Gly 








740 


745 










750 






Thr 


Phe 


Leu 


Val Gin Ser Gin 


Pro Gly 


Arg 


Tyr 


Asn 


Arg 


Ala 


Arg 


Gly 






755 




760 








765 








Leu 


Thr 


Phe 


Ala Met Leu Ala 


Tyr Phe 


lie 


He 


Trp 


Val 


Ser 


Phe 


Val 




770 




775 








780 










Pro 


Leu 


Leu 


Ala Asn Val Gin 


Val Ala 


Tyr 


Gin 


Pro 


Ala 


Val 


Gin 


Met 


785 






790 






795 










800 


Gly 


Ala 


lie 


Leu Phe Cys Ala 


Leu Gly 


He 


Leu 


Ala 


Thr 


Phe 


His 


Leu 








605 




810 










815 




Pro 


Lys 


Cys 


Tyr Val Leu Leu 


Trp Leu 


Pro 


Glu 


Leu 


Asn 


Thr 


Gin 


Glu 








820 


825 










830 






Phe 


Phe 


Leu 


Gly Arg Ser Pro 


Lys Glu 


Ala 


Ser 


Asp 


Gly 


Asn 


Ser 


Gly 






835 




840 








845 








Ser 


Ser 


Glu 


Ala Thr Arg Gly 


His Ser 


Glu 















850 855 



<210> 5 
<211> 841 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note <= 
synthetic construct 

<400> 5 



Met Leu 


Leu Cys 


Thr Ala Arg 


Leu Val 


Gly Leu 


Gin 


Leu 


Leu 


lie 


Ser 


1 




5 




10 








15 




Cys Cys 


Trp Ala 


Phe Ala Cys 


His Ser 


Thr Glu 


Ser 


Ser 


Pro 


Asp 


Phe 




20 




25 








30 






Thr Leu 


Pro Gly 


Asp Tyr Leu 


Leu Ala 


Gly Leu 


Phe 


Pro 


Leu 


His 


Ser 




35 




40 






45 








Gly Cys 


Leu Gin 


Val Arg His 


Arg Pro 


Glu Val 


Thr 


Leu 


Cys 


Asp 


Arg 


50 




55 




60 










Ser Cys 


Ser Phe 


Asn Glu His 


Gly Tyr 


His Leu 


Phe 


Gin 


Ala 


Met 


Arg 


65 




70 




75 










80 



4 
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Leu Gly 


Val 


Glu 


Glu 


He 


Asn 


Asn Ser 


Thr Ala Leu Leu Pro Asn He 










85 








90 95 


Thr 


Leu 


Gly 


Tyr 


Gin 


Leu 


Tyr 


Asp Val 


Cys Ser Asp Ser Ala Asn Val 








100 








105 


110 


Tyr Ala 


Thr 


Leu 


Arg 


Val 


Leu 


Ser Leu 


Pro Gly Gin His His He Glu 






115 










120 


125 


Leu 


Gin 


Gly 


Asp 


Leu 


Leu 


His 


Tyr Ser 


Pro Thr Val Leu Ala Val He 




130 










135 




140 


Gly Pro 


Asp 


Ser 


Thr 


Asn 


Arg 


Ala Ala 


Thr Thr Ala Ala Leu Leu Ser 


145 










150 






155 160 


Pro 


Phe 


Leu 


Val 


Pro 


Met 


He 


Ser Tyr 


Ala Ala Ser Ser Glu Thr Leu 










165 








170 175 


Ser 


Val 


Lys 


Arg 


Gin 


Tyr 


Pro 


Ser Phe 


Leu Arg Thr He Pro Asn Asp 








180 








185 


190 


Lys 


Tyr 


Gin 


Val 


Glu 


Thr 


Met 


Val Leu 


Leu Leu Gin Lys Phe Gly Trp 






195 










200 


205 


Thr Trp 


lie 


Ser 


Leu 


Val 


Gly 


Ser Ser 


Asp Asp Tyr Gly Gin Leu Gly 




210 










215 




220 


Val 


Gin 


Ala 


Leu 


Glu 


Asn 


Gin 


Ala Thr 


Gly Gin Gly He Cys lie Ala 


225 










230 






235 240 


Phe 


Lys 


Asp 


He 


Met 


Pro 


Phe 


Ser Ala 


Gin Val Gly Asp Glu Arg Met 










245 








250 255 


Gin Cys 


Leu 


Met 


Arg 


His 


Leu 


Ala Gin 


Ala Gly Ala Thr Val Val Val 








260 








265 


270 


Val Phe 


Ser 


Ser 


Arg 


Gin 


Leu 


Ala Arg 


Val Phe Phe Glu Ser Val Val 






275 










280 


285 


Leu 


Thr 


Asn 


Leu 


Thr 


Gly 


Lys 


Val Trp 


Val Ala Ser Glu Ala Trp Ala 




290 










295 




300 


Leu 


Ser 


Arg 


His 


He 


Thr 


Gly 


Val Pro 


Gly He Gin Arg He Gly Met 


305 










310 






315 320 


Val 


Leu 


Gly 


Val 


Ala 


He 


Gin 


Lys Arg 


Ala Val Pro Gly Leu Lys Ala 










325 








330 335 


Phe 


Glu 


Glu 


Ala 


Tyr 


Ala 


Arg 


Ala Asp 


Lys Lys Ala Pro Arg Pro Cys 








340 








. 345 


350 


His 


Lys 


Gly 


Ser 


Trp 


Cys 


Ser 


Ser Asn 


Gin Leu Cys Arg Glu Cys Gin 






355 










360 


365 


Ala 


Phe 


Met 


Ala 


His 


Thr 


Met 


Pro Lys 


Leu Lys Ala Phe Ser Met Ser 




370 










375 




380 


Ser 


Ala 


Tyr 


Asn 


Ala 


Tyr 


Arg 


Ala Val 


Tyr Ala Val Ala His Gly Leu 


385 










390 






395 400 


His 


Gin 


Leu 


Leu 


Gly 


Cys 


Ala 


Ser Gly 


Ala Cys Ser Arg Gly Arg Val 










405 








410 415 


Tyr 


Pro 


Trp 


Gin 


Leu 


Leu 


Glu 


Gin He 


His Lys Val His Phe Leu Leu 








420 








425 


430 


His 


Lys 


Asp 


Thr 


Val 


Ala 


Phe 


Asn Asp 


Asn Arg Asp Pro Leu Ser Ser 






435 










440 


445 


Tyr 


Asn 


He 


He 


Ala. 


Trp 


Asp 


Trp Asn 


Gly Pro Lys Trp Thr Phe Thr 




450 










455 




460 


Val 


Leu 


Gly 


Ser 


Ser 


Thr 


Trp 


Ser Pro 


Val Gin Leu Asn He Asn Glu 


465 










470 






475 480 


Thr 


Lys 


He 


Gin 


Trp 


His 


Gly 


Lys Asp 


Asn Gin Val Pro Lys Ser Val 










485 








490 495 


Cys 


Ser 


Ser 


Asp 


Cys 


Leu 


Glu 


Gly His 


Gin Arg Val Val Thr Gly Phe 








500 








505 


510 


His 


His 


Cys 


Cys 


Phe 


Glu 


Cys 


Val Pro 


Cys Gly Ala Gly Thr Phe Leu 






515 










520 


525 



5 
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Asn 


Lys 


Ser 


Asp 


Leu 


Tyr 


Arg 


Cys 




530 










535 




Ala 


Pro 


Glu 


Gly 


Ser 


Gin 


Thr 


Cys 


545 










550 






Ala 


Leu 


Arg 


Glu 


His 


Thr 


Ser 


Trp 










565 








Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Gly 


Thr 








580 










Asp 


Thr 


Pro 


Val 


Val 


Arg 


Ser 


Ala 






595 










600 


Leu 


Gly 


Ser 


Leu 


Ala 


Ala 


Gly 


Ser 




610 










615 




Glu 


Pro 


Thr 


Arg 


Pro 


Ala 


Cys 


Leu 


625 










630 






Gly 


Phe 


Thr 


He 


Phe 


Leu 


Ser 


Cys 










645 








lie 


lie 


lie 


Phe 


Lys 


Phe 


Ser 


Thr 








660 










Trp 


Val 


Gin 


Asn 


His 


Gly 


Ala 


Gly 






675 










680 


Ala 


Gin 


Leu 


Leu 


He 


Cys 


Leu 


Thr 




690 










695 




Pro 


Ala 


Arg 


Glu 


Tyr 


Gin 


Arg 


Phe 


705 










710 






Thr 


Glu 


Thr 


Asn 


Ser 


Leu 


Gly 


Phe 










725 








Leu 


Leu 


Ser 


He 


Ser 


Ala 


Phe 


Ala 








740 










Pro 


Glu 


Asn 


Tyr 


Asn 


Glu 


Ala 


Lys 






755 










760 


Asn 


Phe 


Val 


Ser 


Trp 


He 


Ala 


Phe 




770 










775 




Gly 


Lys 


Tyr 


Leu 


Pro 


Ala 


Ala 


Asn 


785 










790 






Ser 


Ser 


Gly 


Phe 


Gly 


Gly 


Tyr 


Phe 










805 








Cys 


Arg 


Pro 


Asp 


Leu 


Asn 


Ser 


Thr 








820 










Asp 


Tyr 


Thr 


Arg 


Arg 


Cys 


Gly 


Ser 






835 










840 



<210> 6 

<211> 839 

<212> PRT 

<213> Artificial Sequence 



bin 


Pro 


cys 


Giy 


Lys Glu 


ulU 


Trp 








D / ±\J 








pne 


pro 


TV 

Arg 


Thr 


vai vai 


pne 


Leu 






b 33 








jDU 


val 


Leu 


Leu 


Ala 


Ala Asn 


rnr 


Leu 




C *7 A 








b to 




Ala 


fO tt 

Gly 


Leu 


pne 


Ala Trp 


tr-5 a 
HIS 


Leu 


bob 








con 






Giy 


Giy 


TV 

Arg 


Leu 


cys pne 


Leu 


lyieu 










bUb 






Giy 


Ser 


Leu 


Tyr 


Giy pne 


pne 


Giy 








620 








Leu 


Arg 


Gin 


Ala 


Leu pne 


Aia 


Leu 














£ A A. 


Leu 


Thr 


val 


Arg 


Ser Phe 


Gin 


Leu 




C C A 

bo U 








cc c 

ODD 




iiys 


vai 


Pro 


inr 


Phe Tyr 


TJ-i a 
XllS 


Aia 


odd 








o / u 






Leu 


pne 


vai 


wec 


ne ocx 


Ser 


tv n a 

a j. a 










OOJ 






Trp 


T mi 

Leu 


vai 


vai 


irp inr 


Pro 


Leu 








»7 A A 
700 








Pro 


His 


Leu 


TT. «| 

Val 


Met Leu 


GlU 


Cys 






715 








720 


He 


Leu 


Ala 


Phe 


Leu Tyr 


Asn 


Gly 




730 








735 




Cys 


Ser 


Tyr 


Leu 


Gly Lys 


Asp 


Leu 


745 








750 






Cys 


Val 


Thr 


Phe 


Ser Leu 


Leu 


Pne 










765 






Phe 


Thr 


Thr 


Ala 


Ser Val 


Tyr 


Asp 








780 








Met 


Met 


Ala 


Gly 


Leu Ser 


Ser 


Leu 






795 








800 


Leu 


Pro 


Lys 


Cys 


Tyr Val 


lie 


Leu 




810 








815 




Glu 


His 


Phe 


Gin 


Ala Ser 


He 


Gin 


825 








830 







Thr 



<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 6 

. Met Gly Pro Arg Ala Lys Thr He Cys Ser Leu Phe Phe Leu Leu Trp 
1 5 10 15 

Val Leu Ala Glu Pro Ala Glu Asn Ser Asp Phe Tyr Leu Pro Gly Asp 

20 25 30 

Tyr Leu Leu Gly Gly Leu Phe Ser Leu His Ala Asn Met Lys Gly lie 



6 
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35 










A ft 


Val 


TT 1 

His 


Leu 


Asn 


Phe 


Leu 


Gin 


Val 




50 










55 




Lys 


Val 


He 


Gly 


Tyr 


Asn 


Leu 


Met 


65 










70 






Glu 


lie 


Asn 


Asn 


Asp 


Ser 


Ser 


Leu 










85 








Glu 


lie 


Val 


Asp 


Val 


Cys 


Tyr 


He 








100 










Tyr 


Phe 


Leu 


Ala 


His 


Glu 


Asp 


Asn 






115 










120 


Ser 


Asn 


Tyr 


He 


Ser 


Arg 


Val 


Val 




130 










135 




Glu 


Ser 


Val 


Met 


Thr 


Val 


Ala 


Asn 


145 










150 






Gin 


lie 


Thr 


Tyr 


Ser 


Ala 


He 


Ser 










165 








Phe 


Pro 


Ala 


Leu 


Leu 


Arg 


Thr 


Thr 








180 










Ala 


Met 


Val 


Gin 


Leu 


Met 


Leu 


His 






195 










200 


Leu 


Val 


Ser 


Ser 


Asp 


Thr 


Tyr 


Gly 




210 










215 




Glu 


Arg 


Val 


Ala 


Arg 


Arg 


Asp 


He 


225 










230 






Pro 


Thr 


Leu 


Gin 


Pro 


Asn 


Gin 


Asn 










245 








Leu 


Val 


Thr 


He 


Val 


Asp 


Lys 


Leu 








260 










Val 


Val 


Phe 


Ser 


Pro 


Asp 


Leu 


Thr 






275 










280 


Leu 


Arg 


Gin 


Asn 


Phe 


Thr 


Gly 


Ala 




290 










295 




Ala 


lie 


Asp 


Pro 


Val 


Leu 


His 


Asn 


305 










310 






Thr 


Phe 


Leu 


Gly 


He 


Thr 


He 


Gin 










325 








Glu 


Phe 


Arg 


Glu 


Trp 


Gly 


Pro 


Gin 








340 










Thr 


Ser 


Gin 


Ser 


Tyr 


Thr 


Cys 


Asn 






355 










360 


Ala 


Thr 


Leu 


Ser 


Phe 


Asn 


Thr 


He 




370 










375 




Val 


Tyr 


Ser 


Val 


Tyr 


Ser 


Ala 


Val 


385 










390 






Ser 


Leu 


Leu 


Gly 


Cys 


Asp 


Lys 


Ser 










405 








Pro 


Trp 


Gin 


Leu 


Leu 


Glu 


Glu 


He 








420 










Asp 


His 


Gin 


He 


Phe 


Phe 


Asp 


Pro 






435 










440 


Glu 


lie 


Val 


Gin 


Trp 


Gin 


Trp 


Asp 




450 










455 




Val 


Ala 


Ser 


Tyr 


Tyr 


Pro 


Leu 


Gin 


465 










470 






lie 


Ser 


Trp 


His 


Thr 


Val 


Asn 


Asn 



485 



45 



XT J. V? 


Met- 
ric u 


Lyu iiys vjiu 


Tyr Glu 


Val 






ou 






m n 

ox 11 


Ala 




Ala Val 


Glu 






*7 c 




8ft 


Leu 


CL.KJ 


biy Veil JjclI 


ucu wiy 


lyr 




on 








OCX. 


I\an 
nail 


7\ an Val filn 
Aou Val vjrm 


Pro Val 


Leu 


"i ft c 
105 






110 




Leu 


Leu 


pro lie Gin 


fST 1 1 TV en 
GlU ASp 


Tyr 






IOC 
1^3 






zn a 

iUa 


Val 


Tin rsl ir Drn 
11c wiy IriO 


7V qt) TV on 
nop Moll 


OCX 






t A ft 

140 






DVia 
Jrlie 


Leu 


QCL LicU Jrllc 


T.on Ti<aii 
JjcU UCU 


Pro 






155 




T Cft 

loQ 


ASp 


G1U 


Leu Arg Asp 


juys vai 


Arg 




170 




175 




Pro 


ser 


Aia Asp HIS 


his vai 


r*l 11 
GIU 


185 






190 




Pne 


Arg 


Trp Asn Trp 


Tl _ Tin 

lie lie 


Val 






205 






Arg 


Asp 


Asn Gly Gin 


Leu Leu 


Giy 






220 






cys 


lie 


Aia Fne Gin 


giu inr 


Leu 






O "3 c 

235 






Mcl 


inr 


be! GlU lilU 




Arg 




T r a 

2, DO 




0 c c 




/"I ~l _ 

um 


um 


ber inr Aia 


Arg vai 


vai 


1 c c 






*5 *7 rt 
48 / 0 




Leu 


Tyr 


his irne irne 


Asn GlU 


Val 






IOC 






vai 


Trp 


Tl a fil a Car* 

lie Aia ber 


Glu Ser 


Trp 






300 






Leu 


xnr 


giu i*eu Giy 


hi s iieu 


Giy 






315 




320 


Ser 


Val 


Pro He Pro 


Gly Phe 


Ser 




330 




335 




Aia 


Giy , 


riO rTO riO 


Leu Ser 


Arg 


345 






350 




Gin 


GlU 


Cys Asp Asn 


Cys Leu 


T\ 

Asn 






365 






Leu 


Arg 


Leu Ser Gly 


Glu Arg 


■» T_ T 

vai 






360 






Tyr 


Ala 


vai Aia his 


Ala Leu 


TT J _ 

HIS 






395 




A ft. ft, 

4 00 


lnr 


Cys 


inr i*ys Arg 


Va 1 Va 1 
Val Val 


Tyr 




A 1 ft 

410 




A T C 
413 




Trp 


Lys 


vai ASn Fne 


1 iiX lieu 


Leu 


425 






430 




Gin 


Gly 


Asp Val Ala 


Leu His 


Leu 






445 






Arg 


Ser 


Gin Asn Pro 


Phe Gin 


Ser 






460 






Arg 


Gin 


Leu Lys Asn 


He Gin 


Asp 






475 




480 


Thr 


He 


Pro Met Ser 


Met Cys 


Ser 




490 




495 
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Lys 


Arg 


Cys 


Gin 


Ser 


Gly 


Gin 


Lys 








500 














XT lie 


Gl ii 


v»yo 


Tl e 
xxe 


A cn 
no^ 








5i 5 










520 


111! 




Asp 


ulU 


Tyr 


uiu 


v*y o 


Gl Tl 
wXll 














5*3 5 




lyx 


gi n 


Ser 


gi ii 


1I1X 


Cor 

OCX 


wye 




54 5 
















irp 


nlS 




Ala 


Pro 


Tnr 


Tl e 
xxe 


Til a 

AXd 














i 




rile 


Leu 


Ser 


lilx 


Leu 


Ala 


Tl e 

xxe 


Leu 








con 
O o U 










inr 


Pro 


lie 


Val 


Arg 


OCI 


7V 1 a 
Ala 


•criy 






D jD 










son 


Tnr 


Leu 


T cm 

Leu 


T All 

Leu 


vai 


Aia 


Tyr 






O 1U 










61 5 
OIj 




pro 


Lys 


vaj. 


ser 


Thr 


Cys 


Leu 


Cys 














■ 




pne 


rpl, — 
XXIX 


lie 


Cys 


lie 


Ser 


Cys 


Tlo 
lie 










645 








Cys 


Til a 
Axel 


Dho 


Lys 




Ala 


Cor 
oex 


Arg 








DDU 










V a 1 


Arg 


Tyr 


viin 


uiy 


Pro 


Tyr 


Va 1 
val 






p / D 










oou 


Lys 


nee 


Va 1 

vai 


lie 


vai 


Val 

vai 


Tl <=» 
lie 


nl \r 
uiy 


















rpV-i -v 

TfllT 


Thr 


Arg 


Tnr 


Asp 


Pro 


Asp 


Asp 


705 










/ 10 






TV nM 

Asn 


Pro 


AS11 


Tyr 


TV 

Arg 


Asn 


ber 


Leu 










TOR 








Leu 


Leu 


Ser 


Va 1 
Val 


Val 


Gl 

wiy 


XT IlC 


Ser 








7A ft. 










Pro 


mr 


Asn 


Tyr 


Asn 


UlU 


7Al a 
Ala 


Lys 






755 










/ O U 


Tyr 


file 


inr 


Ser 


Ser 


Va 1 
Val 


Ser 


Leu 




770 










775 




Gly 


Val 


Leu 


Val 


Thr 


He 


Val 


Asp 


785 










790 






Leu 


Ala 


He 


Ser 


Leu 


Gly 


Tyr 


Phe 










805 








Phe 


Tyr 


Pro 


Glu 


Arg 


Asn 


Thr 


Pro 








820 










Gly 


Tyr 


Thr 


Met 


Arg 


Arg 


Asp 





835 



Lys 


Lys 


Pro 


Val 


Gly lie 


His 


Val 


505 








510 






Leu 


Pro 


Glv 

vjxy 


Thr 


Phe Leu 


Asn 


His 










525 






Ala 


Cvs 


Pro 


Asn 


Asn Glu 


Tro 


Ser 








540 








Lys 


Ara 


Gin 


Leu 


Val Phe 


Leu 


Glu 






555 








560 


Val 


Ala 


Leu 


Leu 


Ala Ala 

-ni a rtia 


Leu 


Glv 
vjxy 




570 








575 




Val 


lie 

lie 


Phe 
±r lie 


lip 


4nl y xix a 


Phe 
it xie 


Gin 

VJtXll 


585 








590 






Glv 
vjxy 


Pro 

XrX VJ 


Meh 


v—y o 


rue ueu 


Met* 
i*ie u 


ueu 










605 






Val 
Val 


Va 1 
Val 


Dr/v 
xrlU 


Val 
Val 


Txrr Va 1 
lyx Val 


Hi v 
vjiy 


Pro 








620 








Arg 


m n 

UlU 


Ivl a 
Axa 


Leu 


Phe Pro 


xieu. 


Cys 






fi"*5 

DO 9 










al a 

Ala 


Val 
Val 


Arg 


Ser 


Phe Gin 


Tl e 
xxe 


Val 
val 




650 








655 




Phe 


Xr x 


Arg 


Ala 


Tyr Ser 


lyx 


ixp 


665 








670 






oex 


1*1 e 


Ala 


Phe lie Thr 


Val 

vai 


Tien 
xieu 










685 






Mpt- 

rleu 


T>oii 


Al a 
nla 


Thr Gly Leu 


oex 


XT 1 W 








700 








Dm 
rl vj 


U y a 


Tl e 
xxe 


Thr 


He Val 


Car 

oex 


v*y o 






71 5 

/ 13 








75 0 


JjcU 


Ir XlC 


7A an 
noil 


Thr 


Ser Leu 


7A on 


Leu 




7^0 

/ J VJ 








7"? 5 




Phe 

Ir ue 


711 a 
ilia 


lyr 


Met Gly Lys 


VJX LI 


Tien 
liC u 


745 








750 






PVip 

IT XlC 


Tl e 
xxe 


lilx 


Leu 


Ser Met 


lilx 


rjVip 
Jrile 










765 






Cys 


inr 




Met 


Ser Ala 


Tyr 


Ser 








780 








Leu 


Leu 


Val 


Thr Val Leu 


Asn 


Leu 






795 








800 


Gly 


Pro 


Lys 


Cys Tyr Met 


He 


Leu 




810 








815 




Ala 


Tyr 


Phe 


Asn 


Ser Met 


He 


Gin 


825 








830 







<210> 7 
<211> 852 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note « 
synthetic construct 

<400> 7 

Met Leu Gly Pro Ala Val Leu Gly Leu Ser Leu Trp Ala Leu Leu His 
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15 10 15 

Pro Gly Thr Gly Ala Pro Leu Cys Leu Ser Gin Gin Leu Arg Met Lys 

20 25 30 

Gly Asp Tyr Val Leu Gly Gly Leu Phe Pro Leu Gly Glu Ala Glu Glu 

35 40 45 

Ala Gly Leu Arg Ser Arg Thr Arg Pro Ser Ser Pro Val Cys Thr Arg 

50 55 SO 

Phe Ser Ser Asn Gly Leu Leu Trp Ala Leu Ala Met Lys Met Ala Val 
65 70 75 80 

Glu Glu He Asn Asn Lys Ser Asp Leu Leu Pro Gly Leu Arg Leu Gly 

85 90 95 

Tyr Asp Leu Phe Asp Thr Cys Ser Glu Pro Val Val Ala Met Lys Pro 

100 105 HO 

Ser Leu Met Phe Leu Ala Lys Ala Gly Ser Arg Asp He Ala Ala Tyr 

115 120 ~ 125 

Cys Asn Tyr Thr Gin Tyr Gin Pro Arg Val Leu Ala Val He Gly Pro 

130 135 140 

His Ser Ser Glu Leu Ala Met Val Thr Gly Lys Phe Phe Ser Phe Phe 
145 150 155 160 

Leu Met Pro Gin Val Ser Tyr Gly Ala Ser Met Glu Leu Leii Ser Ala 

165 170 175 

Arg Glu Thr Phe Pro Ser Phe Phe Arg Thr Val Pro Ser Asp Arg Val 

180 185 190 

Gin Leu Thr Ala Ala Ala Glu Leu Leu Gin Glu Phe Gly Trp Asn Trp 

195 200 205 

Val Ala Ala Leu Gly Ser Asp Asp Glu Tyr Gly Arg Gin Gly Leu Ser 

210 215 220 

He Phe Ser Ala Leu Ala Ala Ala Arg Gly He Cys He Ala His Glu 
225 230 235 240 

Gly Leu Val Pro Leu Pro Arg Ala Asp Asp Ser Arg Leu Gly Lys Val 

245 250 " 255 

Gin Asp Val Leu His Gin Val Asn Gin Ser Ser Val Gin Val Val Leu 

260 265 270 

Leu Phe Ala Ser Val His Ala Ala His Ala Leu Phe Asn Tyr Ser He 

275 280 285 

Ser Ser Arg Leu Ser Pro Lys Val Trp Val Ala Ser Glu Ala Trp Leu 

290 295 300 

Thr Ser Asp Leu Val Met Gly Leu Pro Gly Met Ala Gin Met Gly Thr 
305 310 315 320 

Val Leu Gly Phe Leu Gin Arg Gly Ala Gin Leu His Glu Phe Pro Gin 

325 330 335 

Tyr Val Lys Thr His Leu Ala Leu Ala Thr Asp Pro Ala Phe Cys Ser 

340 345 350 

Ala Leu Gly Glu Arg Glu Gin Gly Leu Glu Glu Asp Val Val Gly Gin 

355 360 " 365 

Arg Cys Pro Gin Cys Asp Cys He Thr Leu Gin Asn Val Ser Ala Gly 

370 375 380 

Leu Asn His His Gin Thr Phe Ser Val Tyr Ala Ala Val Tyr Ser Val 
385 390 395 400 

Ala Gin Ala Leu His Asn Thr Leu Gin Cys Asn Ala Ser Gly Cys Pro 

405 410 415 

Ala Gin Asp Pro Val Lys Pro Trp Gin Leu Leu Glu Asn Met Tyr Asn 

420 425 430 

Leu Thr Phe His Val Gly Gly Leu Pro Leu Arg Phe Asp Ser Ser Gly 

435 440 445 

Asn Val Asp Met Glu Tyr Asp Leu Lys Leu Trp Val Trp Gin Gly Ser 
450 455 ~ 460 
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vai 


Pro 


Arg 


Leu 


HIS 


IV i-i 

ASp 


vai 


Giy 


495 
















Glu 


Arg 


Leu 


Lys 


lie 


Arg 


Trp 


nlo 










A O C 

485 








Ser 


Arg 


Cys 


Ser 


Arg 


Gin 


Cys 


Gin 








e a a 

500 










Gly 


Phe 


His 


Ser 


Cys 


Cys 


Tyr 


Asp 






515 










520 


Tyr 


Arg 


Gin 


Asn 


Pro 


TV M«K 

ASp 


ABp 


lie 




CIA 

530 










5 j5 




Glu 


Trp 


Ser 


pro 


GIU 


Arg 


Ser 


inr 


tZA R 










DD\J 






pne 


Lieu 


TV 1 = 

Ala 


Trp 


Giy 


GIU 


Pro 


Ala 


















ser 


Leu 


tv i 

Ala 


T mi 

Leu 


Giy 


Lieu 


vai 


L6U 








con 
580 










HIS 


Arg 


Asp 


Ser 


Pro 


Leu 


Val 


Gin 






595 










600 


Phe 


Gly 


Leu 


Val 


Cys 


Leu 


Gly 


Leu 




610 










615 




Pro 


Gly 


Gin 


Pro 


Ser 


Pro 


Ala 


Arg 


625 










630 






His 


Leu 


Pro 


Leu 


Thr 


Gly 


Cys 


Leu 










645 








Glu 


lie 


Phe 


Val 


Glu 


Ser 


Glu 


Leu 








e e a 
660 










Ser 


Gly 


Cys 


Leu 


Arg 


Gly 


Pro 


Trp 






675 










680 


Met 


Leu 


Val 


Glu 


Val 


Ala 


Leu 


Cys 




690 










695 




Pro 


Glu 


Val 


Val 


Thr 


Asp 


Trp 


HIS 


T A C 

705 










710 






HIS 


Cys 


Arg 


Thr 


Arg 


Ser 


Trp 


vai 










/ ZD 








Asn 


Ala 


ml, v 

Txir 


Leu 


TV 1 a 

Ala 


pne 


Leu 


Cys 








^ a n 










Ser 


Gin 


pro 


Gly 


TV 

Arg 


Tyr 


Asn 


Arg 






755 










•7 C A 

760 


Leu 


Ala 


Tyr 


Phe 


lie 


Thr 


Trp 


val 




770 










775 




Val 


Gin 


Val 


Val 


Leu 


Arg 


Pro 


Ala 


















Cys 


Val 


Leu 


Gly 


He 


Leu 


Ala 


Ala 










805 








Leu 


Met 


Arg 


Gin 


Pro 


Gly 


Leu 


Asn 








820 










Gly 


Pro 


Gly 


Asp 


Ala 


Gin 


Gly 


Gin 






835 










840 


Gly 


Lys 


His 


Glu 











850 



<210> 8 

<211> 2526 

<212> DNA 

<213> Artificial Sequence 
<220> 



Arg 


IT IlC 


Asn 




T,on 7A.T"rr 


Thr 






475 






480 


LUX. 


OCX 


7A on 
nop 


7A an dl n 


T.ira P>*rt 
uyo x^i-w 


Val 




490 






495 




GIU 


oiy 


Gin 


vai Arg 




T.irfl 

Jjyo 


505 








510 




Cys 


vai 


Asp 


Cys Glu 


Ala Giy 


Ser 








5^5 5 






Aia 


Cys 


mr 


Phe Cys 


Giy Gin 


Asp 








j1 U 






Arg 


Cys 


ir lie 


TV TV irry 

Arg Arg 


Arg Ser 


Arg 






c: c= c 
-j j j 






^ fin 

sou 


Vdl 


Leu 


JJCU 


JJCU JjcU 


jjcu. ucu 


Leu 




Zj I V 






ETC 
D f O 




Aia 


TV "1 a 

Aia 


Leu 


vjxy JjcU 


rUc val 


TTJ o 
XI IB 


585 








3 J U 




TV 1 -j 

Aia 


faer 


Giy 


Giy pro 


Lieu Aia 


Cys 














Val 


Cys 


Leu 


Ser Val 


Leu Leu 


Pne 








A 

620 






Cys 


Leu 


Ala 


Gin Gin 


Pro Leu 


Ser 






b J5 






£ A A 
O40 


Ser 


Thr 


Leu 


Pne Leu 


Gin Aia 


TV T _ 

Ala 




<r c a 
650 






655 




Pro 


Leu 


Ser 


Trp Ala 


Asp Arg 


Leu 


665 








£1 A 

670 




TV 1 <m 

Ala 


Trp 


Leu 


Val Val 


Leu Leu 


TV T « 

Ala 








o85 






Thr 


Trp 


Tyr 


Leu Val 


Ala Phe 


Pro 








T A A 

700 






Met 


Leu 


Pro 


Txir Glu 


TV T -» T n>* 

Aia Leu 


vai 






715 






*70 A 

7 Z 0 


oer 


pne 


Giy 


Leu Ala 


TT J — TV "1 a 

nlS Aia 


Thr 




/ J U 






•70 C 

/JO 




pne 


Leu 


Giy 


inr pne 


JjcU Val 


Arg 


/ *t D 












Aia 


Arg 


Giy 


Lieu inr 


nViQ TVl a 

pne Aia 










zoo 






Ser 


pne 


vai 


Pro Leu 


Leu Ala 


Asn 








/ o u 






Val 


Gin 


Met 


Gly Ala 


Leu Leu 


Leu 






795 






800 


Phe 


His 


Leu 


Pro Arg 


Cys Tyr 


Leu 




810 






815 




Thr 


Pro 


Glu 


Phe Phe 


Leu Gly 


Gly 


825 








830 




Asn 


Asp 


Gly 


Asn Thr 


Gly Asn 


Gin 



845 



10 



WO 2005/015158 



PCT/US2004/025459 



<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 8 

atgctgctct gcacggctcg cctggtcggc ctgcagcttc tcatttcctg ctgctgggcc 60 

tttgcctgcc atagcacgga gtcttctcct gacttcaccc tccccggaga ttacctcctg 120 

gcaggcctgt tccctctcca ttctggctgt ctgcaggtga ggcacagacc cgaggtgacc 180 

ctgtgtgaca ggtcttgtag cttcaatgag catggctacc acctcttcca ggctatgcgg 240 

cttggggttcj aggagataaa caactccacg gccctgctgc ccaacatcac cctggggtac 300 

cagctgtatg atgtgtgttc tgactctgcc aatgtgtatg ccacgctgag agtgctctcc 360 

ctgccagggc aacaccacat agagctccaa ggagaccttc tccactattc ccctacggtg 42 0 

ctggcagtga ttgggcctga cagcaccaac cgtgctgcca ccacagccgc cctgctgagc 480 

cctttcctgg tgcccatgat tagctatgcg gccagcagcg agacgctcag cgtgaagcgg 540 

cagtatccct ctttcctgcg caccatcccc aatgacaagt accaggtgga gaccatggtg 600 

ctgctgctgc agaagttcgg gtggacctgg atctctctgg ttggcagcag tgacgactat 660 

gggcagctag gggtgcaggc actggagaac caggccactg gtcaggggat ctgcattgct 720 

ttcaaggaca tcatgccctt ctctgcccag gtgggcgatg agaggatgca gtgcctcatg 780 

cgccacctgg cccaggccgg ggccaccgtc gtggttgttt tttccagccg gcagttggcc 84 0 

agggtgtttt tcgagtccgt ggtgctgacc aacctgactg gcaaggtgtg ggtcgcctca 900 

gaagcctggg ccctctccag gcacatcact ggggtgcccg ggatccagcg cattgggatg 960 

gtgctgggcg tggccatcca gaagagggct gtccctggcc tgaaggcgtt tgaagaagcc 1020 

tatgcccggg cagacaagaa ggcccctagg ccttgccaca agggctcctg gtgcagcagc 1080 

aatcagctct gcagagaatg ccaagctttc atggcacaca cgatgcccaa gctcaaagcc 1140 

ttctccatga gttctgccta caacgcatac cgggctgtgt atgcggtggc ccatggcctc 1200 

caccagctcc tgggctgtgc ctctggagct tgttccaggg gccgagtcta cccctggcag 1260 

cttttggagc agatccacaa ggtgcatttc cttctacaca aggacactgt ggcgtttaat 1320 

gacaacagag atcccctcag tagctataac ataattgcct gggactggaa tggacccaag 1380 

tggaccttca cggtcctcgg ttcctccaca tggtctccag ttcagctaaa cataaatgag 1440 

accaaaatcc agtggcacgg aaaggacaac caggtgccta agtctgtgtg ttccagcgac 1500 

tgtcttgaag ggcaccagcg agtggttacg ggtttccatc actgctgctt tgagtgtgtg 1560 

ccctgtgggg ctgggacctt cctcaacaag agtgacctct acagatgcca gccttgtggg 1620 

aaagaagagt gggcacctga gggaagccag acctgcttcc cgcgcactgt ggtgtttttg 1680 

gctttgcgtg agcacacctc ttgggtgctg ctggcagcta acacgctgct gctgctgctg 1740 

ctgcttggga ctgctggcct gtttgcctgg cacctagaca cccctgtggt gaggtcagca 1800 

gggggccgcc tgtgctttct tatgctgggc tccctggcag caggtagtgg cagcctctat 1860 

ggcttctttg gggaacccac aaggcctgcg tgcttgctac gccaggccct ctttgccctt 192 0 

ggtttcacca tcttcctgtc ctgcctgaca gttcgctcat tccaactaat catcatcttc 1980 

aagttttcca ccaaggtacc tacattctac cacgcctggg tccaaaacca cggtgctggc 2040 

ctgtttgtga tgatcagctc agcggcccag ctgcttatct gtctaacttg gctggtggtg 2100 

tggaccccac tgcctgctag ggaataccag cgcttccccc atctggtgat gcttgagtgc 2160 

acagagacca actccctggg cttcatactg gccttcctct acaatggcct cctctccatc 2220 

agtgcctttg cctgcagcta cctgggtaag gacttgccag agaactacaa cgaggccaaa 2280 

tgtgtcacct tcagcctgct cttcaacttc gtgtcctgga tcgccttctt caccacggcc 2340 

agcgtctacg acggcaagta cctgcctgcg gccaacatga tggctgggct gagcagcctg 2400 

agcagcggct tcggtgggta ttttctgcct aagtgctacg tgatcctctg ccgcccagac 2460 

ctcaacagca cagagcactt ccaggcctcc attcaggact acacgaggcg ctgcggctcc 2520 

acctga 2526 

<210> 9 
<211> 2559 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note o 
synthetic construct 
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<400> 9 

atgctgggcc ctgctgtcct gggcctcagc ctctgggctc tcctgcaccc tgggacgggg 60 

gccccattgt gcctgtcaca gcaacttagg atgaaggggg actacgtgct gggggggctg 120 

ttccccctgg gcgaggccga ggaggctggc ctccgcagcc ggacacggcc cagcagccct 180 

gtgtgcacca ggttctcctc aaacggcctg ctctgggcac tggccatgaa aatggccgtg 240 

gaggagatca acaacaagtc ggatctgctg cccgggctgc gcctgggcta cgacctcttt 300 

gatacgtgct cggagcctgt ggtggccatg aagcccagcc tcatgttcct ggccaaggca 360 

ggcagccgcg acatcgccgc ctactgcaac tacacgcagt accagccccg tgtgctggct 420 

gtcatcgggc cccactcgtc agagctcgcc atggtcaccg gcaagttctt cagcttcttc 480 

ctcatgcccc aggtcagcta cggtgctagc atggagctgc tgagcgcccg ggagaccttc 540 

ccctccttct tccgcaccgt gcccagcgac cgtgtgcagc tgacggccgc cgcggagctg 600 

ctgcaggagt tcggctggaa ctgggtggcc gccctgggca gcgacgacga gtacggccgg 660 

cagggcctga gcatcttctc ggccctggcc gcggcacgcg gcatctgcat cgcgcacgag 720 

ggcctggtgc cgctgccccg tgccgatgac tcgcggctgg ggaaggtgca ggacgtcctg 780 

caccaggtga accagagcag cgtgcaggtg gtgctgctgt tcgcctccgt gcacgccgcc 840 

cacgccctct tcaactacag catcagcagc aggctctcgc ccaaggtgtg ggtggccagc 900 

gaggcctggc tgacctctga cctggtcatg gggctgcccg gcatggccca gatgggcacg 960 

gtgcttggct tcctccagag gggtgcccag ctgcacgagt tcccccagta cgtgaagacg 102 0 

cacctggccc tggccaccga cccggccttc. tgctctgccc tgggcgagag ggagcagggt 1080 

ctggaggagg acgtggtggg ccagcgctgc ccgcagtgtg actgcatcac gctgcagaac 1140 

gtgagcgcag ggctaaatca ccaccagacg ttctctgtct acgcagctgt gtatagcgtg 1200 

gcccaggccc tgcacaacac tcttcagtgc aacgcctcag gctgccccgc gcaggacccc 1260 

gtgaagccct ggcagctcct ggagaacatg tacaacctga ccttccacgt gggcgggctg 1320 

ccgctgcggt tcgacagcag cggaaacgtg gacatggagt acgacctgaa gctgtgggtg 1380 

tggcagggct cagtgcccag gctccacgac gtgggcaggt tcaacggcag cctcaggaca 1440 

gagcgcctga agatccgctg gcacacgtct gacaaccaga agcccgtgtc ccggtgctcg 1500 

cggcagtgcc aggagggcca ggtgcgccgg gtcaaggggt tccactcctg ctgctacgac 1560 

tgtgtggact gcgaggcggg cagctaccgg caaaacccag acgacatcgc ctgcaccttt 1620 

tgtggccagg atgagtggtc cccggagcga agcacacgct gcttccgccg caggtctcgg 1680 

ttcctggcat ggggcgagcc ggctgtgctg ctgctgctcc tgctgctgag cctggcgctg 1740 

ggccttgtgc tggctgcttt ggggctgttc gttcaccatc gggacagccc actggttcag 1800 

gcctcggggg ggcccctggc ctgctttggc ctggtgtgcc tgggcctggt ctgcctcagc 1860 

gtcctcctgt tccctggcca gcccagccct gcccgatgcc tggcccagca gcccttgtcc 1920 

cacctcccgc tcacgggctg cctgagcaca ctcttcctgc aggcggccga gatcttcgtg 1980 

gagtcagaac tgcctctgag ctgggcagac cggctgagtg gctgcctgcg ggggccctgg 2040. 

gcctggctgg tggtgctgct ggccatgctg gtggaggtcg cactgtgcac ctggtacctg 2100 

gtggccttcc cgccggaggt ggtgacggac tggcacatgc tgcccacgga ggcgctggtg 2160 

cactgccgca cacgctcctg ggtcagcttc ggcctagcgc acgccaccaa tgccacgctg 2220 

gcctttctct gcttcctggg cactttcctg gtgcggagcc agccgggctg ctacaaccgt 2280 

gcccgtggcc tcacctttgc catgctggcc tacttcatca cctgggtctc ctttgtgccc 2340 

ctcctggcca atgtgcaggt ggtcctcagg cccgccgtgc agatgggcgc cctcctgctc 2400 

tgtgtcctgg gcatcctggc tgccttccac ctgcccaggt gttacctgct catgcggcag 2460 

ccagggctca acacccccga gttcttcctg ggagggggcc ctggggatgc ccaaggccag 2520 

aatgacggga acacaggaaa tcaggggaaa catgagtga 2559 

<210> 10 
<211> 2518 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note o 
synthetic construct 

<400> 10 

atggggccca gggcaaagac catctgctcc ctgttcttcc tcctatgggt cctggctgag 60 

ccggctgaga actcggactt ctacctgcct ggggattacc tcctgggtgg cctcttctcc 120 
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ctccatgcca acatgaaggg cattgttcac cttaacttcc tgcaggtgcc catgtgcaag 180 

gagtatgaag tgaaggtgat aggctacaac ctcatgcagg ccatgcgctt cgcggtggag 240 

gagatcaaca atgacagcag cctgctgcct ggtgtgctgc tgggctatga gatcgtggat 300 

gtgtgctaca tctccaacaa tgtccagccg gtgctctact tcctggcaca cgaggacaac 360 
ctccttccca tccaagagga ctacagtaac tacatttccc gtgtggtggc tgtcattggc . 420 

cctgacaact ccgagtctgt catgactgtg gccaattcct ctccctattt ctccttccac 480 

agatcaccta cagcgccatc agcgatgagc tgcgagacaa ggtgcgcttc ccggctttgc 540 

tgcgtaccac acccagcgcc gaccaccacg tcgaggccat ggtgcagctg atgctgcact 600 

tccgctggaa ctggatcatt gtgctggtga gcagcgacac ctatggccgc gacaatggca 660 

gctgcttggc gagcgcgtgg cccggcgcga catctgcatc gccttccagg agacgctgcc 720 

cacactgcag cccaaccaga acatgacgtc agaggagcgc cagcgcctgg tgaccattgt 780 

ggacaagctg cagcagagca cagcgcgcgt cgtggtcgtg ttctcgcccg acctgaccct 840 

gtaccacttc ttcaatgagg tgctgcgcca gaacttcacg ggcgccgtgt ggatcgcctc 900 

cgagtcctgg gccatcgacc cggtcctgca caacctcacg gagctgggcc acttgggcac 960 

cttcctgggc atcaccatcc agagcgtgcc catcccgggc ttcagtgagt tccgcgagtg 1020 

gggcccacag gctgggccgc cacccctcag caggaccagc cagagctata cctgcaacca 1080 

ggagtgcgac aactgcctga acgccacctt gtccttcaac accattctca ggctctctgg 1140 

ggagcgtgtc gtctacagcg tgtactctgc ggtctatgct gtggcccatg ccctgcacag 1200 

cctcctcggc tgtgacaaaa gcacctgcac caagagggtg gtctacccct ggcagctgct 1260 

tgaggagatc tggaaggtca acttcactct cctggaccac caaatcttct tcgacccgca 1320 

aggggacgtg gctctgcact tggagattgt ccagtggcaa tgggaccgga gccagaatcc 1380 

cttccagagc gtcgcctcct actaccccct gcagcgacag ctgaagaaca tccaagacat 1440 

ctcctggcac accgtcaaca acacgatccc tatgtccatg tgttccaaga ggtgccagtc 1500 

agggcaaaag aagaagcctg tgggcatcca cgtctgctgc ttcgagtgca tcgactgcct 1560 

tcccggcacc ttcctcaacc acactgaaga tgaatatgaa tgccaggcct gcccgaataa 1620 

cgagtggtcc taccagagtg agacctcctg cttcaagcgg cagctggtct tcctggaatg 1680 

gcatgaggca cccaccatcg ctgtggccct gctggccgcc ctgggcttcc tcagcaccct 1740 

ggccatcctg gtgatattct ggaggcactt ccagacaccc atagttcgct cggctggggg 1800 

ccccatgtgc ttcctgatgc tgacactgct gctggtggca tacatggtgg tcccggtgta 1860 

cgtggggccg cccaaggtct ccacctgcct ctgccgccag gccctctttc ccctctgctt 1920 

cacaatttgc atctcctgta tcgccgtgcg ttctttccag atcgtctgcg ccttcaagat 1980 

ggccagccgc ttcccacgcg cctacagcta ctgggtccgc taccaggggc cctacgtctc 2040 

tatggcattt atcacggtac tcaaaatggt cattgtggta attggcatgc tggccacggg 2100 

cctcagtccc accacccgta ctgaccccga tgaccccaag atcacaattg tctcctgtaa 2160 

ccccaactac cgcaacagcc tgctgttcaa caccagcctg gacctgctgc tctcagtggt 2220 

gggtttcagc ttcgcctaca tgggcaaaga gctgcccacc aactacaacg aggccaagtt 2280 

catcaccctc agcatgacct tctatttcac ctcatccgtc tccctctgca ccttcatgtc 2340 

tgcctacagc ggggtgctgg tcaccatcgt ggacctcttg gtcactgtgc tcaacctcct 2400 

ggccatcagc ctgggctact tcggccccaa gtgctacatg atcctcttct acccggagcg 2460 

caacacgccc gcctacttca acagcatgat ccagggctac accatgagga gggactag 2518 

<210> 11 
<211> 2577 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note ■ 
synthetic construct 

<400> 11 

atgccgggtt tggctatctt gggcctcagt ctggctgctt tcctggagct tgggatgggg. 60 

tcctctttgt gtctgtcaca gcaattcaag gcacaagggg actatatatt gggtggacta 120 

tttcccctgg gcacaactga ggaggccact ctcaaccaga gaacacagcc caacggcatc 180 

ctatgtacca ggttctcgcc ccttggtttg ttcctggcca tggctatgaa gatggctgta 240 

gaggagatca acaatggatc tgccttgctc cctgggctgc gactgggcta tgacctgttt 300 

gacacatgct cagagccagt ggtcaccatg aagcccagcc tcatgttcat -ggccaaggtg 360 
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ggaagtcaaa gcattgctgc ctactgcaac tacacacagt accaaccccg tgtgctggct 420 

gtcattggtc cccactcatc agagcttgcc ctcattacag gcaagttctt cagcttcttc 480 

ctcatgccac aggtcagcta tagtgccagc atggatcggc taagtgaccg ggaaacattt 540 

ccatccttct tccgcacagt gcccagtgac cgggtgcagc tgcaggccgt tgtgacactg 600 

ttgcagaatt tcagctggaa ctgggtggct gccttaggta gtgatgatga ctatggccgg 660 

gaaggtctga gcatcttttc tggtctggcc aactcacgag gtatctgcat tgcacacgag 720 

ggcctggtgc cacaacatga cactagtggc caacaattgg gcaaggtggt ggatgtgcta 780 

cgccaagtga accaaagcaa agtacaggtg gtggtgctgt ttgcatctgc ccgtgctgtc 840 

tactcccttt ttagctacag catccttcat gacctctcac ccaaggtatg ggtggccagt 900 

gagtcctggc tgacctctga cctggtcatg acacttccca atattgcccg tgtgggcact 960 

gttcttgggt ttctgcagcg cggtgcccta ctgcctgaat tttcccatta tgtggagact 1020 

cgccttgccc tagctgctga cccaacattc tgtgcctccc tgaaagctga gttggatctg 1080 

gaggagcgcg tgatggggcc acgctgttca caatgtgact acatcatgct acagaacctg 1140 

tcatctgggc tgatgcagaa cctatcagct gggcagttgc accaccaaat atttgcaacc 1200 

tatgcagctg tgtacagtgt ggctcaggcc cttcacaaca ccctgcagtg caatgtctca 1260 

cattgccaca catcagagcc tgttcaaccc tggcagctcc tggagaacat gtacaatatg 1320 

agtttccgtg ctcgagactt gacactgcag tttgatgcca aagggagtgt agacatggaa 1380 

tatgacctga agatgtgggt gtggcagagc cctacacctg tactacatac tgtaggcacc 1440 

ttcaacggca cccttcagct gcagcactcg aaaatgtatt ggccaggcaa ccaggtgcca 1500 

gtctcccagt gctcccggca gtgcaaagat ggccaggtgc gcagagtaaa gggctttcat 1560 

tcctgctgct atgactgtgt ggactgcaag gcagggagct accggaagca tccagatgac 1620 

ttcacctgta ctccatgtgg caaggatcag tggtccccag aaaaaagcac aacctgctta 1680 

cctcgcaggc ccaagtttct ggcttggggg gagccagctg tgctgtcact tctcctgctg 1740 

ctttgcctgg tgctgggcct gacactggct gccctggggc tctttgtcca ctactgggac 1800 

agccctcttg ttcaggcctc aggtgggtca ctgttctgct ttggcctgat ctgcctaggc 1860 

ctcttctgcc tcagtgtcct tctgttccca ggacgaccac gctctgccag ctgccttgcc 1920 

caacaaccaa tggctcacct ccctctcaca ggctgcctga gcacactctt cctgcaagca 1980 

gccgagatct ttgtggagtc tgagctgcca ctgagttggg caaactggct ctgcagctac 204 0 

cttcggggcc cctgggcttg gctggtggta ctgctggcca ctcttgtgga ggctgcacta 2100 

tgtgcctggt acttgatggc tttccctcca gaggtggtga cagattggca ggtgctgccc 2160 

acggaggtac tggaacactg ccgcatgcgt tcctgggtca gcctgggctt ggtgcacatc 2220 

accaatgcag tgttagcttt cctctgcttt ctgggcactt tcctggtaca gagccagcct 2280 

ggtcgctata accgtgcccg tggcctcacc ttcgccatgc tagcttattt catcatctgg 2340 

gtctcttttg tgcccctcct ggctaatgtg caggtggcct accagccagc tgtgcagatg 2400 

ggtgctatct tattctgtgc cctgggcatc ctggccacct tccacctgcc caaatgctat 2460 

gtacttctgt ggctgccaga gctcaacacc caggagttct tcctgggaag gagccccaag 2520 

gaagcatcag atgggaatag tggtagtagt gaggcaactc ggggacacag tgaatga 2577 

<210> 12 j 
<211> 137 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 



<400> 12 



Pro Ser Pro Phe 


Arg Asp He 


Val 


Ser 


Tyr 


Pro 


Asp Lys He He Leu 


1 


5 






10 




15 


Gly Cys Phe Met 


Asn Leu Lys 


Thr 


Ser 


Ser 


Val 


Ser Phe Val Leu Leu 


20 






25 






30 


Leu Leu Leu Cys 


Leu Leu Cys 


Phe 


He 


Phe 


Ser 


Tyr Met Gly Lys Asp 


35 




40 








45 


Leu Pro Lys Asn 


Tyr Asn Glu 


Ala 


Lys 


Ala 


He 


Thr Phe Cys Leu Leu 


50 


55 










60 


Leu Leu He Leu 


Thr Trp He 


He 


Phe 


Thr 


Thr 


Ala Ser Leu Leu Tyr 
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65 


70 






His Ser 


Leu 


85 






lie Tyr Ser Phe Leu 


Leu Trp 


Tyr 


100 






He Phe Gin Pro Gin 


Lys Asn 


Thr 


115 




120 


Gin Asp Tyr Thr Lys 


Thr He 


Ser 


130 


135 





<210> 13 

<211> 242 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial 
synthetic construct 



<400> 13 










Phe 


Ala 


Val 


Asn 


Tyr Asn 


Thr 


Pro 


1 








5 






Met 


Cys 


Phe 


Leu 


He Leu 


Gly 


Cys 








20 








pne 


pne 


Tyr 


Phe 


Glu Arg 


pro 


Thr 






35 








40 


Met 


Pro 


Phe 


Leu 


Leu Phe 


Tyr 


Ala 




50 








55 




Arg 


Ser 


Phe 


Gin 


He Val 


He 


He 


65 








70 






Arg 


Val 


His 


Ser 


Trp Trp 


Met 


Lys 










85 






Ser 


Met 


Thr 


Phe 


Val Leu 


Gin 


Ala 








100 








Ser 


Asn 


Pro 


Pro 


Leu Pro 


Tyr 


Xaa 






115 








120 


He 


He 


Leu Gly 


Cys Asp 


Val 


Asn 




130 








135 




Phe 


Leu 


Leu 


Leu 


Leu Leu 


Cys 


He 


145 








150 






Gly 


Lys 


Asp Leu 


Pro Lys 


Asn 


Tyr 










165 






Cys 


Leu 


Leu 


Leu 


Leu He 


Leu 


Thr 








180 








Met 


Leu 


Tyr His 


Gly Lys 


Tyr 


He 






195 








200 


Leu 


Ser 


Ser 


Ala 


Tyr Cys 


Phe 


Leu 




210 








215 




Tyr 


He 


He 


He 


Phe Gin 


Pro 


His 


225 








230 






Leu 


Ser 













<210> 14 
<211> 165 
<212> PRT 



75 






80 


Asn Ala Leu Ala Val 


Leu 


Ser 


Ser 


90 




95 




Phe Leu Pro Lys Cys 


Tyr 


He 


He 


105 


110 






Gin ijys Tyr Phe Gin 


Gly 


Leu 


He 



125 



Gin 



Sequence; note o 



val 


Val 

10 


Arg 


Ser 


Ala 


uiy 


15 




Leu 


Ser 


Leu 


Cys 


Ser 


He 


Ser 


Val 


25 










30 






Glu 


Ala 


Phe 


Cys 


He 
45 


Leu 


Arg 


Phe 


Val 


Cys 


Leu 


Ala 
60 


Cys 


Phe 


Ala 


Val 


Phe 


Lys 


He 
75 


Ala 


Ala 


Lys 


Phe 


Pro 
80 


Tyr 


His 
90 


Gly 


Gin 


Trp 


Leu 


Val 
95 


He 


Val 


Val 


He 


Val 


He 


Gly 


Phe 


Ser 


105 










110 






Xaa 


Phe 


Val 


Ser 


Tyr 
125 


Pro 


Asp 


Lys 


Leu 


Asn 


Met 


Ala 
140 


Ser 


Thr 


Ser 


Phe 


Leu 


Cys 


Phe 
155 


Thr 


Phe 


Ser 


Tyr 


Met 
160 


Asn 


Glu 
170 


Ala 


Lys 


Ala 


lie 


Thr 
175 


Phe 


Trp 


He 


He 


Phe 


Ala 


Thr 


Ala 


Phe 


185 










190 






His 


Thr 


Leu 


Asn 


Ala 
205 


Leu 


Ala 


Val 


Leu 


Trp 


Tyr 


Phe 
220 


Leu 


Pro 


Lys 


Cys 


Lys 


Asn 


Thr 


Gin 


Lys 


Tyr 


Phe 


Gin 



235 240 
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<213> Artificial Sequence 



<220> 

<223> Description of Artificial 
synthetic construct 



<400> 14 










Lys 


Lys 


Gin 


wxy jfro 




vai 


riSp 


X 






5. 








Leu 


Cys 


He 


ser val 
20 


Leu 


Gly 


vai 




Asp 


Leu 

JO 


Asp Fne 


Tyr 


Met 


Asp 
* u 


Thr 


Leu 
50 


Ser 


Pro Gly 


oer 


rne 
55 


lie 


Ser 


Val 


Leu 


Cys Phe 


Pne 


Pne 


Ser 


65 








70 






Asn 


Tyr 


Asn 


Glu Ala 
85 


Lys 


Cys 


Val 


He 


Ser 


Trp 


He Ser 
100 


Phe 


Phe 


Thr 


Phe 


Thr 


Val 
115 


Ala Ala 


Tyr 


Val 


Cys 
120 


Phe 


Phe 
130 


Gly 


Gly Tyr 


Phe 


Leu 
135 


Pro 


Pro 


Gin 


Met 


Asn Thr 


Thr 


Ala 


His 


145 








150 






Thr 


Met 


Ser 


Lys Gin 
165 









<210> 15 

<211> 236 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthetic construct 



<400> 15 



Ala 


Pro 


Lys 


Ser 


Ser 


Gin 


Arg 


Xaa 


1 








5 








Glu 


Trp 


Asp 


His 


Pro 


Met 


Ser 


Val 








20 










Cys 


Leu 


Leu 


Met 


Thr 


Ser 


Ser 


Ser 






35 










40 


Asn 


Thr 


Pro 


Val 


Ala 


Lys 


Ser 


Ala 




50 










55 




Leu 


Ala 


Ala 


Leu 


Thr 


Ala 


Ala 


Ala 


65 










70 






Gin 


Pro 


Ser 


Pro 


Leu 


Ala 


Ser 


Lys 










85 








Ser 


Phe 


Thr 


Val 


Cys 


Leu 


Ala 


Cys 








100 










His 


Leu 


His 


Phe 


Xaa 


He 


Arg 


Val 






115 










120 


Trp 


Ala 


Lys 


Asn 


His 


Gly 


Pro 


Xaa 



Sequence; note = 



Tie Phe Tie 


Val 


Ser 


Val Thr He 


1 0 






15 


Ala Val Gly 


Pro 


Pro 


Glu Pro Ser 


?5 
*c 






30 


Ser Tie Val 


Leu 


Glu 


r*vs Ser Asn 






45 




Glu Leu Cys 


Tyr Val 


Cys Val Leu 




60 






Tyr Met Gly 


Lys Asp 


Leu Pro Ala 


75 






80 


Thr Phe Ser 


Leu 


Met 


Val Tyr Met 


90 






95 


Val Tyr Leu 


He 


Ser 


Arg Gly Pro 


105 






110 


Ala Thr Leu 


Val 


Ser 


Val Leu Ala 






125 




Lys. He Tyr 


He 


He 


Val Leu Lys 




140 






Phe Gin Asn 


Cys 


He 


Gin Met Tyr 


155 






160 



Sequence; note = ' 



Leu 


Arg 


Arg 


Thr 


Arg 


Leu Xaa Leu 




10 








15 


Ala 


Leu 


Leu 


Phe 


Phe 


Leu Val Cys 


25 










30 


Ala 


Val 


He 


Leu 


Leu 


Leu Asn He 










45 




Gly 


Gly 


Xaa 


Thr 


Cys 


Xaa Leu Lys 








60 






Met 


Ser 


Ser 


Xaa 


Cys 


His Phe Gly 






75 






80 


Leu 


Lys 


Gin 


Pro 


Gin 


Phe Thr Phe 




90 








95 


Asn 


Arg 


Cys 


Ala 


Leu 


Ala Thr Gly 


105 










110 


Ala 


Leu 


Pro 


Pro 


Ala 


Tyr Asn Xaa 










125 




Ala 


Thr 


He 


Phe 


He 


Ala Ser Ala 
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130 135 140 

Ala lie Leu Cys Val Leu Cys Leu Arg Val Ala Val Gly Pro Pro Gin 
145 150 155 160 

Pro Ser Gin Asx Leu Asx Phe Xaa Thr Asn Ser He Xaa Leu Xaa Xaa 

165 170 175 

Ser Asn Thr Leu Ser Pro Gly Ser Phe Val Glu Leu Cys Asn Val Ser 

180 185 190 

Leu Leu Ser Ala Val Cys Phe Val Phe Ser Xaa Met Gly Lys Asx Leu 

195 200 . 205 

Pro Ala Asn Tyr Asn Glu Ala Lys Cys Val Thr Phe Ser Leu Met Val 

210 215 220 

Asn Xaa He Ser Trp He Ser Phe Phe Thr Val Tyr 
225 230 235 

<210> 16 
<211> 838 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 



<400> 16 



Met 


Gly 


Pro 


Arg 


Ala 


Lys - Thr 


He 


Cys 


Ser 


Leu 


Phe 


Phe 


Leu 


Leu 


Trp 


1 








rr 
D 










i n 

iU 










1 ^ 

13 




Val 


Leu 


Ala 


Glu 


Pro 


Ala 


Glu 


Asn 


Ser 


Asp 


Phe 


Tyr 


Leu 


Pro 


Gly 


Asp 








20 










25 










30 






Tyr 


Leu 


Leu 


Gly 


Gly 


Leu 


Phe 


Ser 


Leu 


His 


Ala 


Asn 


Met 


Lys 


Gly 


He 






35 










40 










45 








Val 


His 


Leu 


Asn 


Phe 


Leu 


Gin 


Val 


Pro 


Met 


Cys 


Lys 


Glu 


Tyr 


Glu 


Val 




50 










55 










60 










Lys 


Val 


lie 


Gly 


Tyr 


Asn 


Leu 


Met 


Gin 


Ala 


Met 


Arg 


Phe 


Ala 


Val 


Glu 


65 










70 










75 










80 


Glu 


He 


Asn 


Asn 


Asp 


Ser 


Ser 


Leu 


Leu 


Pro 


Gly 


Val 


Leu 


Leu 


Gly 


Tyr 










85 










90 










95 




Glu 


He 


val 


Asp 


Val 


Cys 


Tyr 


He 


Ser 


Asn 


Asn 


Val 


Gin 


Pro 


Val 


Leu 








100 










105 










110 






Tyr 


Phe 


Leu 


Ala 


His 


Glu 


Asp 


Asn 


Leu 


Leu 


Pro 


He 


Gin 


Glu 


Asp 


Tyr 






115 










120 










125 








Ser 


Asn 


Tyr 


He 


Ser 


Arg 


Val 


Val 


Ala 


Val 


He 


Gly 


Pro 


Asp 


Asn 


Ser 




130 










135 










140 










Glu 


Ser 


Val 


Met 


Thr 


Val Ala 


Asn 


Phe 


Leu 


Ser 


Leu 


Phe 


Leu 


Leu 


Pro 


145 










150 










155 










160 


Gin 


He 


Thr 


Tyr 


Ser 


Ala 


He 


Ser 


Asp 


Glu 


Leu 


Arg 


Asp 


Lys 


Val 


Arg 










165 










170 










175 




Phe 


Pro 


Ala 


Leu 


Leu 


Arg Thr 


Thr 


Pro 


Ser 


Ala 


Asp 


His 


His 


Val 


Glu 








180 










185 










190 






Ala 


Met 


Val 


Gin 


Leu 


Met 


Leu 


His 


Phe 


Arg 


Trp 


Asn 


Trp 


He 


He 


Val 






195 










200 










205 








Leu 


Val 


Ser 


Ser 


Asp 


Thr Tyr 


Gly 


Arg 


Asp 


Asn 


Gly 


Gin 


Leu 


Leu 


Gly 




210 










215 










220 










Glu 


Arg 


Val 


Ala 


Arg 


Arg Asp 


He 


Cys 


He 


Ala 


Phe 


Gin 


Glu 


Thr 


Leu 


225 










230 










235 










240 


Pro 


Thr 


Leu 


Gin 


Pro 


Asn 


Gin 


Asn 


Met 


Thr 


Ser 


Glu 


Glu 


Arg 


Gin 


Arg 










245 










250 










255 




Leu 


Val 


Thr 


He 


Val 


Asp Lys 


Leu 


Gin 


Gin 


Ser 


Thr 


Ala 


Arg 


Val 


Val 
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260 


ZOO 


0 on 


vai 


vai 


DV»e» 


Car 

oer 


Fro Asp lieu inr Leu lyr his pne 


Fne Asn biu vai 






Z / D 




O Qn 
Z O U 




Leu 


Arg 


Gill 


Asn 


TiVi « •v- n1„ t\ T _ TT.-* T T>-*-'r-\ Tl « H T ^ 

pne xnr biy Aia vai irp lie Ala 


ser uiu oer irp 










Zyo oUO 




Ala 


He 


Asp 


Pro 


Val Leu His Asn Leu Thr Glu Leu 


Gly His Leu Gly 


305 








310 315 


320 




JrJQe 


Leu 


viiy 


lie inr lie bin oer vai rro lie 


Fro vjxy Fiic ocx 








Jzb JiU 


^ 0. c 


ti 


File 


Arg 


blU 


1 rp biy Fro bin Aid biy Fro Fro 


Fro iieu oer Axvg 








O *± u 






Thx 


C pk 7^ 
OCX 






Tvt* Thf r*vfl Ann Oln filii Pvr Art*» 

J. jr J- J- Hi- v—y 0 noil V3J.ll vjjl kl V* y D nop 


Ann r*VR Tien Asn 

noli v» jf 0 ucu nsu 












ICR 

Job 


Ala 


±nr 


Leu 


Ser 


Fne Asn rnr lie lieu Arg lieu oer 


biy biu Arg vai 




J / U 






3 f D O O U 




Val 


Tyr 


Ser 


vai 


ryr oer Aia vai ryr Aia vai Aia 


TT i „ TV T — T cm 17<i ft 

his Aia iteu his 


1 fl C 
Job 








■a on toe 

jyo jyb 


Ann 


Ser 


Leu 


Leu 


biy 


cys Asp Lys oer xnr cys inr -uys 


Arg vai vai Tyr 










41U 


4lb 


Pro 


Trp 


Villi 


Leu 


jjeu uiu uiu lie irp itys vai Asn 


Fne inr i»eu iieu 








AO O 


AO C 


a ^ n 


Asp 


TT J _ 

HIS 


Gin 


TT a 

lie 


pne pne Asp pro Gin Gly Asp val 


Ala Leu His Leu 










AAA 


AAC 

44b 


Glu 


He 


Val 


Gin 


Trp Gin Trp Asp Arg Ser Gin Asn 


Pro Phe Gin Ser 




y( r A 

4bU 








■ 


If - 1 

Val 


Ala 


Ser 


Tyr 


Tyr Pro Leu Gin Arg Gin Leu Lys 


Asn He Gin Asp 


465 








470 475 


480 


lie 


Ser 


Trp 


His 


Thr Val Asn Asn Thr He Pro Met 


Ser Met Cys Ser 










a q c /ton 
4ob 43U 


A O C 

49b 


Lys 


TV -*-<•<» 

Arg 


cys 


Cain 


Ser Gly Gin Lys Lys Lys Pro Val 


/Tin Tl TT J _ TT- 1 

Gly He His. Val 








c nn 
bUU 


e n C 


biu 


Cys 


Cys 


pne 


blU 


cys lie Asp cys iieu Fro oiy inr 


Phe Leu Asn His 






3 ID 




con 


DZ b 


Tnr 




Asp 


VilU 


flit » v" Tit /^t! f» Hi w TV T /**• r m v*^x T\ hm 

iyr biu cys bin Aia cyB Fro Asn 


Asn biu irp oer 




can 
3Jv 










Tyr 




Ser 


flu 
blU 


inr oer cys Fne iiys Arg bin ijeu 


vai Fne Leu biu 


b45 








C C ft err 

bbU bbb 


C tZ ft 

boU 


HIS 


QlU 


vai 


Pro 


xnr lie vai vai Aia lie lieu Aia 


Ala Leu Gly Phe 










C C C CO ft 

bob b /U 


c 0 c 

b/b 


Pne 


Ser 


Thr 


Leu 


Ala lie Leu Phe lie Phe Trp Arg 


tt J _ T^V»'j^ /*ll m mL, 

His Pne Gin Tnr 








O 0 u 


toe 

bob 


con 

by u 


Pro 


Met 


vai 


TV 

Arg 


ser Aia Gly. uiy Pro wet cys pne 


T a«i A/TaW T m«« TT-1 

Leu wet Leu vai . 






C Q C 




ton 
DUU 


oUb 


Pro 


Leu 


Leu 


T AT* 

Leu 


Ala pne Gly Met vai pro vai Tyr 


vai Gly Pro pro 




ci ft 
610 






bib ozU 




Thr 


Val 


Pne 


Ser 


Cys Pne Cys Arg Gin Ala Pne Pne 


Tnr val Cys Phe 


625 








630 635 


640 


Ser 


He 


Cys 


Leu 


Ser Cys He Thr Val Arg Ser Phe 


Gin lie Val Cys 










CAE fiKA 


tec 


Val 


Phe 


Lys 


Met 


Ala Arg Arg Leu Pro Ser Ala Tyr 


Ser Phe Trp Met 








660 


665 


670 


Arg 


Tyr 


His 


Gly 


Pro Tyr Val Phe Val Ala Phe He 


Thr Ala He Lys 






675 




680 


685 


Val 


Ala 


Leu 


val 


Val Gly Asn Met Leu Ala Thr Thr 


He Asn Pro He 




690 






695 700 




Gly 


Arg 


Thr 


Asp 


Pro Asp Asp Pro Asn He Met lie 


Leu Ser Cys His 
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705 








710 










715 








720 


Pt*o Ann 

XT 1 W noil 


T T\ry 
xyx. 


Arg Asn 


Glv 
uiy 


Leu 


Leu 


Phe 


Asn 


Thr 


Ser 


Met 


Asn Leu 


Leu 








725 










730 








735 






Val 


Leu Gly 


Phe 


Ser 


Phe 


Ala 


Tvr 


Met 


Gly Lys 


Glu Leu 


Pro 






740 










745 










750 




Thr A flii 


Tvr 
lyi 


Asn 


Glu 


Ala 


Lys 


Phe 


lie 


Thr 


Leu 


Ser 


Met 


Thr Phe 


Ser 




755 










760 










765 






Phe Thr 


Ser 


Ser 


He 


Ser 


Leu Cys 


Thr 


Phe 


Met 


Ser 


Val 


His Asp 


Glv 


770 










775 










780 








Val Leu 


Val 


Thr 


He 


Met 


Asp Leu 


Leu 


Val 


Thr 


Val 


Leu 


Asn Phe 


Leu 


785 








790 










795 








800 


Ala lie 


Gly 


Leu Gly 


Tvr 


Phe Gly 


Pro 


Lvs 


Cys 


Tyr Met 


He Leu 


Phe 








805 










810 








815 




Tyr Pro 


Glu 


Arg Asn 


Thr 


Ser Ala 


Tvr 


Phe 


Asn 


Ser 


Met 


He Gin 


Glv 






820 










825 










830 




Tyr Thr 


Met 


Arg Lys 


Ser 






















835 


























<210> 17 


























<211> 844 


























<212> PRT 




i 






















<213> Artificial Sequence 


















<220> 




























<223> Description of Artificial 


Sequence; note « 






synthetic 


construct 


















<400> 17 


























Met Gly 


Pro 


Gin 


Ala 


Arg 


Thr 


Leu 


Cys 


Leu 


Leu 


Ser 


Leu 


Leu Leu 


His 


1 






5 










10 








15 




Val Leu 


Pro 


Lys 


Pro 


Gly 


Lys 


Leu 


Val 


Glu 


Asn 


Ser 


Asp 


Phe His 

f^l~lG 111 0 


Leu 






20 










25 










30 




Ala Gly 


Asp 


Tyr 


Leu 


Leu 


Gly 


Gly 


Leu 


Phe 


Thr 


Leu 


His 


A 1 =j Jinn 

411 CL noil 


V CXI 




35 










40 










45 






Lys Ser 


lie 


Ser 


His 


Leu 


Ser 


Tyr 


Leu 


Gin 


Val 


Pro 


Lys 


CjfD nb il 




50 










55 










60 








Phe Thr 


Met 


Lys 


Val 


Leu 


Gly 


Tyr 


Asn 


Leu 


Met 


Gin 


Ala 




rlic 


65 








70 










75 








80 


Ala val 


Glu 


Glu 


He 


Asn 


Asn 


Cys 


Ser 


Ser 


Leu 


Leu 


Pro 


fllv Val 
vjiy val 


JJCU 








85 










90 












Leu Gly 


Tyr 


Glu 


Met 


Val 


Asp 


Val 


Cys 


Tyr 


Leu 


Ser 


Asn 


AOll 11C 


His 






100 










105 










110 




Pro Gly 


Leu 


Tyr 


Phe 


Leu. 


Ala 


Gin 


Asp 


Asp 


Asp 


Leu 


Leu 


Pro He 


Leu 




115 










120 










125 






Lys Asp 


Tyr 


Ser 


Gin 


Tyr 


Met 


Pro 


His 


Val 


Val 


Ala 


Val 


Tip filv 

■Lie vjiy 




130 










135 










140 








Asp Asn 


Ser 


Glu 


Ser 


Ala 


He 


Thr 


Val 


Ser 


Asn 


He 


Leu 


Ser* His 
uci hid 


Phe 
rue 


145 








150 










155 








160 


Leu lie 


Pro 


Gin 


He 


Thr 


Tyr 


Ser 


Ala 


He 


Ser 


Asp 


Lys 


Lieu At*ct 

ucu xai y 


Asp 








165 










170 








175 




Lys Arg 


His 


Phe 


Pro 


Ser 


Met 


Leu 


Arg 


Thr 


Val 


Pro 


Ser 


Ala Thr 


His 




180 










185 










190 




His He 


Glu 


Ala 


Met 


Val 


Gin 


Leu 


Met 


Val 


His 


Phe 


Gin 


Trp Asn 


Trp 




195 










200 










205 






He Val 


Val 


Leu 


Val 


Ser 


Asp 


Asp 


Asp 


Tyr 


Gly 


Arg 


Glu 


Asn Ser 


His 


210 










215 










220 








Leu Leu 


Ser 


Gin 


Arg 


Leu 


Thr 


Lys 


Thr 


Ser 


Asp 


He 


Cys 


He Ala 


Phe 
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225 










230 






Gin 


Glu 


Val 


Leu 


Pro 
245 


He 


Pro 


Glu 


Glu 


Gin 


Arg 


Gin 
260 


Leu 


Asp 


Asn 


He 


Ala 


Arg 


Val 
275 


Val 


Val 


Val 


Phe 


Ser 
280 


Phe 


His 
290 


Glu 


Val 


Leu 


Arg 


Trp 
295 


Asn 


Ser 


Glu 


Ser 


Tro 


Ala 


He 


Asp 


Pro 


305 










310 






Arc? 


His 


Thr 


Glv 


Thr 
325 


Phe 


Leu 


Gly 


Pro 


Glv 


Phe 


Ser 
340 


Gin 


Phe 


Arcr 


Val 


Val 


Pro 


Asn 
355 


Thr 


Thr 


Asn 


Leu 


Arcr 
360 


Ala 


Cys 
370 


Leu 


Asn 


Thr 


Thr 


Lvs 
375 


Ser 


Glv 


Glu 


Area 


Val 


Val 


Tvr 


Ser 


Val 


385 










390 






His 


Ala 


Leu 


His 


Arcr 
405 


Leu 


Leu 


Gly 


Gin 


Lys 


Val 


Tyr 
420 


Pro 


Trp 


Gin 


Leu 


Phe 


Thr 


Leu 
435 


Leu 


Gly 


Asn 


Arg 


Leu 
440 


Pro 


Met 
450 


Leu 


Leu 


ASD 


He 


He 
455 


Gin 


Pro 


Phe 


Gin 


Ser 


He 


Ala 


Ser 


Tyr 


465 










470 






Tvr 


He 


Asn 


Asn 


Val 
485 


Ser 


Trp 


Tyr 


Ser 


Met 


Cvs 


Ser 
500 


Lys 


Ser 


Cys 


Gin 


Glv 


Leu 


His 
515 


Pro 


Cvs 


Cvs 


Phe 


Glu 
520 


Tvr 


Leu 
530 


Asn 


ArQ 


Ser 


Ala 


Asp 
535 


Glu ; 


Ser 


Met 


Tro 


Ser 


Tvr 


Lvs 


Asn 


Asp 


545 










550 






Thr 


Phe 


Leu 


Glu 


Trp 
565 


Trp 


His 


Glu 


Leu 


Ala 


Ala 


Leu 
580 


Gly 


Phe 


Leu 


Ser 


Tro 


Arcr 


His 
595 


Phe 


Gin 


Thr 


Pro 


He 
600 


Cvs 


Phe 
610 


Leu 


Met 


Leu 


Thr 


Leu 
615 


Leu 


Val 


Tvr 


Val 


Glv 


Pro 


Pro 


Lvs 


Val 


625 










630 






Leu 


Phe 


Pro 


Leu 


Cys 
645 


Phe 


Thr 


He 


Ser 


Phe 


Gin 


He 
660 


Val 


Cys 


Ala 


Phe 


Ala 


Tyr 


Ser 
675 


Tyr 


Trp 


Val 


Arg 


Tyr 
680 



235 240 



Ser 


Ser 


Gin Val Met 


Arg 


Ser 


Glu 




250 








255 




Leu 


Asp 


Lys Leu Arg 


Arg 


Thr 


Ser 


265 








270 






Pro 


Glu 


Leu 


Ser Leu 


Tyr 


Ser 


Phe 








285 








Phe 


Thr 


Gly Phe Val 


Trp 


He 


Ala 








300 








Val 


Leu 


His 


Asn Leu 


Thr 


Glu 


Leu 






315 








320 


Val 


Thr 


He 


Gin Arg 


Val 


Ser 


He 




330 








335 




Arg 


Arg 


Asd 


Lys Pro 


Gly 


Tyr 


Pro 


345 








350 






Thr 


Thr 


Cvs 

v- jr a 


Asn Gin 


Asp 


Cys 


Asp 








365 








Phe 


Asn 


Asn 


lie Leu 


He 


Leu 


Ser 








380 








Tyr 


Ser 


Ala 


Val Tyr 


Ala 


Val 


Ala 






395 








400 


Cys 


Asn 


Arc? 


Val Arg 


Cys 


Thr 


Lys 




410 








415 




Leu. 


Arg 


Glu 


He Trp 


His 


Val 


Asn 


425 








430 






Phe 


Phe 


Asd 


Gin Gin 


Gly 


Asp 


Met 








445 








Trp 


Gin 


Tro 


Asp Leu 


Ser 


Gin 


Asn 








460 








Ser 


Pro 


Thr 


Seir Lvs 


Arg 


Leu 


Thr 






475 








480 


Thr 


Pro 


Asn 


Asn Thr 


Val 


Pro 


Val 




490 








495 




Pro 


Gly 


Gin 


Met Lvs 


Lys 


Ser 


Val 


505 








510 






Cys 


Leu 


Asp 


Cys Met 


Pro 


Gly 


Thr 








525 








Phe 


Asn 


Cys 


Leu Ser 


Cys 


Pro 


Gly 








540 








He 


Thr 


Cys 


Phe Gin 


Arg 


Arg 


Pro 






555 








560 


Ala 


Pro 


Thr 


He Ala 


Val 


Ala 


Leu 




570 








575 




Thr 


Leu 


Ala 


He Leu 


Val 


He 


Phe 


585 








590 






Val 


Arg 


Ser 


Ala Gly 


Gly 


Pro 


Met 








605 








Leu 


Val 


Ala 


Tyr Met 


Val 


Val 


Pro 








620 








Ser 


Thr 


Cys 


Leu Cys 


Arg 


Gin 


Ala 






635 








640 


Cys 


He 


Ser 


Cys He 


Ala 


Val 


Arg 




650 








655 




Lys 


Met 


Ala 


Ser Arg 


Phe 


Pro 


Arg 


665 








670 






Gin 


Gly 


Pro 


Tyr Val 


Ser 


Met 


Ala 



685 
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Phe 


He 


Thr 


Val 


Leu 


Lys 


Met 


Val 


He 


Val 


Val 


He Gly Met Leu Ala 




690 










695 










700 


Thr 


Gly 


Leu 


Ser 


Pro 


Thr 


Thr 


Arg 


Thr 


Asp 


Pro 


Asp Asp Pro Lys He 


705 










710 










715 


720 


Thr 


He 


Val 


Ser 


Cys 


Asn 


Pro 


Asn 


Tyr 


Arg 


Asn 


Ser Leu Leu Phe Asn 










725 










730 




735 


Thr 


Ser 


Leu 


Asp 


Leu 


Leu 


Leu 


Ser 


Val 


Val 


Gly 


Phe Ser Phe Ala Tyr 








740 










745 






750 


Met 


Glv 


Lvb 


Glu 


Leu 


Pro 


Thr 


Asn 


Tyr 


Asn 


Glu 


Ala Lys Phe He Thr 






755 










760 








765 


Leu 


Ser 


Met 


Thr 


Phe 


Tvx 


Phe 


Thr 


Ser 


Ser 


Val 


Ser Leu Cys Thr Phe 




770 










775 










780 


Met 


Ser 


Ala 


Tyr 


Ser 


Gly 


Val 


Leu 


Val 


Thr 


He 


Val Asp Leu Leu Val 


785 










790 










795 


800 


Thr 


Val 


Leu 


Asn 


Leu 


Leu 


Ala 


He 


Ser 


Leu 


Gly 


Tyr Phe Gly Pro Lys 










805- 










810 




815 


Cys 


Tyr 


Met 


He 


Leu 


Phe 


Tyr 


Pro 


Glu 


Arg 


Asn 


Thr Pro Ala Tyr Phe 








820 










825 






830 


Asn 


Ser 


Met 


He 


Gin 


Gly 


Tyr 


Thr 


Met 


Arg 


Arg 


Asp 



835 840 



<210> 18 
<211> 855 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence; note » 
synthetic construct 



<400> 18 


















Met 


Leu 


Gly 


Pro 


Ala 


Val 


Leu 


Gly 


Leu 


Ser 


Leu Trp Ala Leu Leu His 


1 








5 










10 


15 


Pro 


Gly 


Thr 


Gly 


Ala 


Pro 


Leu 


Cys 


Leu 


Ser 


Gin Gin Leu Arg Met Lys 








20 










25 




30 


Gly 


Asp 


Tyr 


Val 


Leu 


Gly 


Gly 


Leu 


Phe 


Pro 


Leu Gly Glu Ala Glu Glu 






35 










40 






45 


Ala 


Gly 


Leu 


Arg 


Ser 


Arg 


Thr 


Arg 


Pro 


Ser 


Ser Pro Val Cys Thr Arg 




50 










55 








60 


Phe 


Ser 


Ser 


Asn 


Gly 


Leu 


Leu 


Trp 


Ala 


Leii 


Ala Met Lys Met Ala Val 


65 










70 










75 80 


Glu 


Glu 


He 


Asn 


Asn 


Lys 


Ser 


Asp 


Leu 


Leu 


Pro Gly Leu Arg Leu Gly 










85 










90 


95 


Tyr 


Asp 


Leu 


Phe 


Asp 


Thr 


Cys 


Ser 


Glu 


Pro 


Val Val Ala Met Lys Pro 








100 










105 




110 


Ser 


Leu 


Met 


Phe 


Leu 


Ala 


Lys 


Ala 


Gly 


Ser 


Arg Asp He Ala Ala Tyr 






115 










120 






125 


Cys 


Asn 


Tyr 


Thr 


Gin 


Tyr 


Gin 


Pro 


Arg 


Val 


Leu Ala Val He Gly Pro 




130 










135 








140 


His 


Ser 


Ser 


Glu 


Leu 


Ala 


Met 


Val 


Thr 


Gly 


Lys Phe Phe Ser Phe Phe 


145 










150 










155 160 


Leu 


Met 


Pro 


Gin 


Val 


Ser 


Tyr 


Gly 


Ala 


Ser 


Met Glu Leu Leu Ser Ala 










165 










170 


175 


Arg 


Glu 


Thr 


Phe 


Pro 


Ser 


Phe 


Phe 


Arg 


Thr 


Val Pro Ser Asp Arg Val 






180 










185 




190 
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Gin 


Leu 


Thr 


TV *1 mm 

Ala 


Ala 


Ala 


Glu 


Leu 


Leu 


Gin 


GlU 


Pne 


/"ll Tr 

Giy 


Trp 


TV err 

Asn 


Trp 






195 










O A A 

200 










2 05 








Val 


TV 1 — 

Ala 


Ala 


Leu 


Gly 


Ser 


Asp 


Asp 


CjIU 


Tyr 


o±y 


Arg 


uxn 




Leu 


Ser 




210 










215 










O. *5 A 

220 










lie 


Phe 


Ser 


Ala 


Leu 


Ala 


TV 1 mm 

Ala 


Ala. 


Arg 


Gly 


lie 


cys 


Tl a 

lie 


TV T — 

Aia 


TT £ — 

rllB 


rO tt 
GlU 


225 










230 










ml JD 










o a n 


Gly 


Leu 


Val 


Pro 


Leu 


Pro 


Arg 


TV T — . 

Ala 


Asp 


71 

Asp 


O A W 

ser 


Arg 


T All 

Leu 


Giy 


iiys 


vai 










245 










OCA 

250 










ore 

2bb 




Gin 


Asp 


Val 


Leu 


His 


Gin 


Val 


Asn 


Gin 


Ser 


Ser 


TT— TJ 

val 


Gin 


vai 


TT— TJ 

vai 


Leu 






260 










*■» LZ c 

265 










2 /O 






Leu 


Phe 


Ala 


Ser 


Val 


His 


71 1 mm 

Ala 


Ala 


His 


7Vl a 

Ala 


Leu 


pne 


TV a*i 

Asn 


Tyr 


Ser 


Tl A. 

lie 






one 

275 










O A 

280 










n o c 

2 ob 








Ser 


Ser 


Arg 


Leu 


Ser 


Pro 


Lys 


ir. Tj 

Val 


Trp 


TT- T 

vai 


TV 1 

Aia 


Ser 


GlU 


TV 1 -j 

Aia 


Trp 


Leu 




O O A 
290 










2 73 










inn 










Thr 


Ser 


Asp 


Leu 


Val 


Met 


Gly 


Leu 


Pro 


tjiy 


Met 


TVl =» 

Aia 


uin 


wee 


iaiy 


Tnr 


O A C 

J Ob 










Ol ft 

J1U 










J 19 










ion 


Val 


Leu 


Gly 


Phe 


Leu 


Gin 


Arg 


Gly 


7\ "1 — 

Ala 


Gin 


Leu 


TT J — 

HIS 


GlU 


pne 


Pro 


/IT *T 

Gin 










325 










OTA 

330 










*3 "5 C 

335 




Tyr 


Val 


Lys 


Thr 


His 


Leu 


Ala 


Leu 


Ala 


Thr 


Asp 


Pro 


TV 1 — 

Ala 


Pne 


Cys 


Ser 








340 










345 










350 






Ala 


Leu 


Gly 


Glu 


Arg 


Glu 


Gin 


Gly 


Leu 


Glu 


Glu 


Asp 


Val 


Val 


Gly 


Gin 






355 










360 










365 








Arg 


Cys 


Pro 


Gin 


Cys 


Asp 


Cys 


He 


Thr 


Leu 


Gin 


Asn 


Val 


Ser 


Ala 


Gly 




370 










375 










380 










Leu 


Asn 


His 


His 


Gin 


Thr 


Phe 


Ser 


Val 


Tyr 


Ala 


Ala 


Val 


Tyr 


Ser 


Val 


385 










390 
















• 




A A A 

400 


Ala 


Gin 


Ala 


Leu 


His 


Asn 


Thr 


Leu 


Gin 


Cys 


Asn 


TV "1 _ 

Ala 


Ser 


Gly 


Cys 


Pro 










405 










/I ft 

410 










41b 




Ala 


Gin 


Asp 


Pro 


Val 


Lys 


Pro 


Trp 


mm. 

Gin 


Leu 


Leu 


Glu 


Asn 


Met 


Tyr 


Asn 








420 










425 










a *y a 
43 0 






Leu 


Thr 


Phe 


His 


Val 


Gly 


Gly 


Leu 


Pro 


Leu 


Arg 


Pne 


Asp 


Ser 


Ser 


Gly 






435 










A A A 

440 










A A C 

44b 








Asn 


Val 


Asp 


Met 


Glu 


Tyr 


Asp 


Leu 


Lys 


Leu 


Trp 


Val 


Trp 


/TJ TJ 

Gin 


Gly 


Ser 




450 










/ICC 

455 










A £ A 










Val 


Pro 


Arg 


Leu 


His 


Asp 


«»« tj 

Val 


Gly 


Arg 


Pne 


Asn 


Gly 


Daw* 

ser 


T Al« 

Leu 


TV mm 

Arg 


Thr 


465 










A 1 f\ 

470 










*k /b 










a p n 
4 o o 


Glu 


Arg 


Leu 


Lys 


lie 


Arg 


Trp 


HIS 


Thr 


Ser 


TV a -a 

Asp 


TV MM 

Asn 


Gin 


Lys 


Pro 


TT- 1 

vai 










A Q C 

4 85 










A Q A 

4 y U 










4yb 




Ser 


Arg 


Cys 


Ser 


Arg 


Gin 


Cys 


Gin 


Glu 


Gly 


Gin 


Val 


Arg 


Arg 


tr. tj 

Val 


Lys 








500 










505 










510 






Gly 


Phe 


His 


Ser 


Cys 


Cys 


Tyr 


Asp 


Cys 


Val 


Asp 


Cys 


Glu 


Ala 


Gly 


Ser 






515 










520 










525 








Tyr 


Arg 


Gin 


Asn 


Pro 


Asp 


Asp 


He 


Ala 


Cys 


Thr 


Phe 


Cys 


Gly 


Gin 


Asp 




530 










535 










540 










Glu 


Trp 


Ser 


Pro 


Glu 


Arg 


Ser 


Thr 


Arg 


Cys 


Phe 


Arg 


Arg 


Arg 


Ser 


Arg 


545 










550 










c c c 

555 










boQ 


Phe 


Leu 


Glu 


Leu 


Ala 


Trp 


Gly 


Glu 


Pro 


Ala 


val 


Leu 


Ser 


T a*« 

Leu 


Leu 


T A1» 

Leu 










565 










Cflft 

570 










575 




Leu 


Leu 


Cys 


Leu 


Val 


Leu 


Gly 


Leu 


Thr 


Leu 


Ala 


Ala 


T A«T 

Leu 


Gly 


T All 

Leu 


TV],- 

pne 








con 
bo U 










cor 
bob 










con 






Val 


His 


Tyr 


Trp 


Asp 


Ser 


Pro 


Leu 


val 


Gin 


Ala 


Ser 


Gly 


Gly 


Ser 


Leu 






595 










600 










605 








Phe 


Cys 


Phe 


Gly 


Leu 


He 


Cys 


Leu 


Gly 


Leu 


Phe 


Cys 


Leu 


Ser 


Val 


Leu 




610 










615 










620 










Leu 


Phe 


Pro 


Gly 


Arg 


Pro 


Arg 


Ser 


Ala 


Ser 


Cys 


Leu 


Ala 


Gin 


Gin 


Pro 


625 










630 










635 










640 


Met 


Ala 


His 


Leu 


Pro 


Leu 


Thr 


Gly 


Cys 


Leu 


Ser 


Thr 


Leu 


Phe 


Leu 


Gin 
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645 



It I Q 

A J. a 


Ala 


G1U 


Tl a nV|Q T7-, l 

lie riie vai 


uJLU 


Cor 








660 






Trp 


Leu 


Cys 


Ser Tyr Leu 


Arg 


oiy 






675 






O O U 


Leu 


Ala 


Thr 


Leu Val Glu 


Ala 


iUd 




690 










pne 


Pro 


Pro 


(jrlU Vai Val 


rpV, v 


Hap 


t n c 

705 






710 






Leu 


UlU 


HIS 


Cys Arg Met 


Arcf 


Ser 








725 . 






He 


Thr 


Asn 


Ala Val Leu 


111 n 

Aia 


Fne 








740 






Val 


Gin 


Ser 


Gin Pro Gly 


Arg 


Tyr 






755 






760 


Ala 


Met 


Leu 


Ala Tyr Pne 


Tl- 
116 


Tie* 

lie 




770 






775 




Ala 


Asn 


Val 


Gin Val Ala 


Tyr 


Gin 


785 






790 






Leu 


Phe 


cys 


Ala Leu Gly 


He 


Leu 








805 






Tyr 


Val 


Leu 


Leu Trp Leu 


Pro 


Glu 








820 






Gly 


Arg 


Ser 


Pro Lys Glu 


Ala 


Ser 






835 




» 


840 


Ala 


Thr 


Arg 


Gly His Ser« Glu 






850 






855 





<210> 19 
<211> 859 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthetic construct 



<400> 19 



Met 


Pro 


Gly 


Leu 


Ala 


He 


Leu 


Gly 


1 








5 








Leu 


Gly 


Met 


Gly 


Ser 


Ser 


Leu 


Cys 








20 










Gly 


Asp 


Tyr 


lie 


Leu 


Gly 


Gly 


Leu 






35 










40 


Ala 


Thr 


Leu 


Asn 


Gin 


Arg 


Thr 


Gin 




50 










55 




Phe 


Ser 


Pro 


Leu 


Gly 


Leu 


Phe 


Leu 


65 










70 






Glu 


Glu 


lie 


Asn 


Asn 


Gly 


Ser 


Ala 










85 








Tyr 


Asp 


Leu 


Phe 


Asp 


Thr 


Cys 


Ser 








100 










Ser 


Leu 


Met 


Phe 


Met 


Ala 


Lys 


Val 






115 










120 


Cys 


Asn 


Tyr 


Thr 


Gin 


Tyr 


Gin 


Pro 




130 










135 




His 


Ser 


Ser 


Glu 


Leu 


Ala 


Leu 


He 





650 










c c c 
ODb 




Glu 


Leu 


Pro 


Leu 


Ser 


Trp 


7Vl ~> 

Aia 


Asn 


665 










o7 0 






Pro 


Trp 


Ala 


Trp 


Leu 


vai 


vai 


Leu 


















lieu 


Cys 


Ala 


Trp 


Tyr 


Leu 


wee 


A 1 fa 

Aia 








/ uu 










Trp 


Gin 


Val 


Leu 


Pro 


inr 


vjIU 


vai 






•71 K 












Trp 


vai 


Ser 


Leu 


Giy 


Leu 


vai 


illS 




H 1 A 
/JO 














Leu 


Cys 


Pne 


Leu 


Giy 


Tnr 


Phe 


Leu 


745 










750 






Asn 


Arg 


Ala 


Arg 


Gly 


Leu 


Thr 


Phe 










765 








Trp 


Val 


Ser 


Phe 


Val 


Pro 


Leu 


Leu 








7o0 










Pro 


Ala 


Val 


Gin 


Met 


Gly 


Ala 


He 






795 










800 


Ala 


Thr 


Phe 


His 


Leu 


Pro 


Lys 


Cys 




810 










815 




Leu 


Asn 


Thr 


Gin 


Glu 


Phe 


Phe 


Leu 


825 










830 






Asp 


Gly 


Asn 


Ser 


Gly 


Ser 


Ser 


Glu 



845 



Sequence; note » 



Leu 


Ser 
10 


Leu 


Ala 


Ala 


Phe 


Leu 
15 


Glu 


Leu 


Ser 


Gin 


Gin 


Phe 


Lys 


Ala 


Gin 


25 










30 






Phe 


Pro 


Leu 


Gly 


Thr 
45 


Thr 


Glu 


Glu 


Pro 


Asn 


Gly 


He 
60 


Leu 


Cys 


Thr 


Arg 


Ala 


Met 


Ala 
75 


Met 


Lys 


Met 


Ala 


Val 
80 


Leu 


Leu 
90 


Pro 


Gly 


Leu 


Arg 


Leu 
95 


Gly 


Glu 


Pro 


Val 


Val 


Thr 


Met 


Lys 


Pro 


105 










110 






Gly 


Ser 


Gin 


Ser 


He 
125 


Ala 


Ala 


Tyr 


Arg 


Val 


Leu 


Ala 
140 


Val 


He 


Gly 


Pro 


Thr 


Gly 


Lys 


Phe 


Phe 


Ser 


Phe 


Phe 
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145 










150 






Leu 


met. 


xrxO 


u±n 


Val 
xoo 


Cor* 


xyr 


Ser 


Arg 


-LU 


±11x7 


trie 
180 


Pro 


OCX 


file 


XT IXC 


Gin 


Leu 


uin 
195 


Aia 


vai 


vai 


TV-it* 
1X117 


T.aii 

JjcU 

200 


Val 


Ala 

210 


Ala 


Leu 


Gly 


0 A V 

ser 


Asp 
215 


Asp 


lie 


Phe 


Ser 


Gly 


Leu 


Ala 


Asn 


ber 


225 










230 






Gly 


Leu 


Val 


Pro 


Gin 

245 


HIS 


TV ny\ 

ASp 


Tnr 


Val 


Asp 


Val 


Leu 
260 


Arg 


Gin 


vai 


Asn 


Leu 


Phe 


TV 1 a 

Ala 
275 


Ser 


TV T -» 

Ala 


TV •*•/•* 

Arg 


TV T a 

Aia 


\T f* 1 

vai 
280 


Leu 


HIS 
290 


TV mv* 

Asp 


Leu 


Ser 


Pro 


Lys 
295 


Val 


Thr 


Ser 


Asp 


Leu 


vai 


Mat* 

Met 


Thr 


Leu 


305 










310 






Val 


Leu 


Gly 


pne 


Leu 

IOC 


Cjin 


Arg 


Giy 


Tyr 


val 


G1U 


Thr 
340 


TV -W-/~# 

Arg 


Leu 


TV 1 o 

Aia 


Leu 


Ser 


Leu 


Lys 
355 


TV 1 a 

Ala 


Q1U 


Leu 


ASp 


Leu 
360 


Cys 


Ser 
370 


Gin 


Cys 


Asp 


Tyr 


lie 
375 


ll.li 

Met 


Met 


Gin 


Asn 


Leu 


Ser 


Ala 


Gly 


Gin 


385 










390 






Tyr 


Ala 


Ala 


Val 


Tyr 
405 


Ser 


Val 


Ala 


Cys 


Asn 


Val 


Ser 
420 


His 


Cys 


His 


Thr 


Leu 


Leu 


Glu 
435 


Asn 


Met 


Tyr 


Asn 


Met 
440 


Leu 


Gin 
450 


Pne 


Asp 


Aia 


T irn 

Lys 


Giy 
455 


ber 


Met 


Trp 


Val 


Trp 


Gin 


Ser 


Pro 


Tnr 


465 










470 






Phe 


Asn 


Gly 


Thr 


T /Ml 

Leu 

APS 


Gin 


T All 

Leu 


Gin 


Asn 


Gin 


Val 


Pro 
500 


vai 


Ser 


Gin 


Cys 


Val 


Arg 


TV <W*M 

Arg 
515 


TT n -| 

val 


Lys 


Giy 


T3Vv a 

pne 


TT ■« Q 

xllS 
520 


Cys 


Lys 
530 


TV T m 

Ala 


Gly 


Ser 


Tyr 


Arg 
535 


Lys 


Pro 


Cys 


Gly 


Lys 


Asp 


Gin 


Trp 


Ser 


545 










550 






Pro 


Arg 


Arg 


Pro 


Lys 
565 


Phe 


Leu 


Glu 


Leu 


Leu 


Leu 


Leu 

580 


Leu 


Leu 


Leu 


Ser 


Ala 


Leu 


Gly 
595 


Leu 


Phe 


Val 


His 


His 
600 







155 








160 


Ala 


Ser 


Met Asp 


Arcr 


Leu 


Ser 


Asp 




170 








175 






Thr 


Val Pro 


Ser 


Asp 


Ara 


Val 


185 








190 






Leu 


Gin 


Asn Phe 


Ser 


Tro 


Asn 


Trt> 








205 








Asp 


x.yx 


Glv Ara 


Glu 


Glv 


Leu 


Ser 






220 












VSTXjr 


Tip fSro 
lie uy d 


Tie 

X X c 


AT a 


His 


Glu 






235 








240 


OCX 


Glv 


Gin Gin 

wXXX VJXli 


XJCIX 


Glv 


Lys 


Val 




250 








255 




Gin 


Ser 


Lys Val 


Gin 


Val 


Val 


Val 


265 








270 






Tvr 


OCX 


i-ICLX XT 


Ser 


xyx 


Ser 


lie 








285 








irp 


Val 

VAX 


Al a ff ey 
rvx ct ocx 


Glu 


Ser 




Leu 






300 










Pro 


Ash 


lie Ala 


At a 


Val 


Glv 


Thr 






315 








320 


A 1 a 

nXa 


Leu 


xjc ix rxu 


Glu 


Phe 


Ser 


His 




330 








335 




AT a 


Ala 


Asp Pro 


Thr 


Phe 


Cys 


Ala 


14 R 
j *± j 








350 






Ol 11 

w_L IX 


Glu 

UXU 


Arg Val 


Met 


Glv 
uxy 


Pro 


Ara 








365 








T,pn 


fll n 

V7XXX 


ADXX XJCLX 


Ser 


Ser 


Glv 
uxy 


Leu 






380 










• 

Leu 


XllS 


it-! a Hi n 

XI X O V3XXX 


xxc 


Phe 
XT xxc 


Ala 

nla 


xxxx 














400 


Gin 


Aia 


Leu His 


Asn 


Ixxx 


L6U 


UlXX 




41U 












ser 


G1U 


fro vai 




Pro 


Trp 


m n 


v« j 








430 






Cor 

OCX 


xrxxc 


*iiy nla 




A en 
nop 


XJC IX 


Thr 








445 










TA oy"n 


Met Glu 




Asp 


Leu 


TjVS 
xxy s 






460 










xrxVO 


Val 
vai 


XICIX XIX o 


xxxx 


Val 

VAX 


Glv 

VJX Jf 


Thr 






475 








480 


Wis 


Ser 


LvB Met 


xyx 


J.X kJ 


Pro 


Glv 












495 




OCX 


Axy 


vjj.ii v*yb 


Lys 


Ann 


Glv 


Gin 


505 








510 






OCX. 




v» jr D x jr x 


Asp 


Cys 


Val 


Asp 








525 








His 


Pro 


Asn Asn 


Phe 


Thr 


Cys 


Thr 






540 










Pro 


Glu 


Lys Ser 


Thr 


Thr 


Cys 


Leu 






555 








560 


Leu 


Ala 


Trp Gly 


Glu 


Pro 


Ala 


Val 




570 








575 




Leu 


Ala 


Leu Gly 


Leu 


Val 


Leu 


Ala 


585 








590 






Arg 


Asp 


Ser Pro 


Leu 


Val 


Gin 


Ala 



605 
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Ser 


Glv 


Gly Pro Leu Ala 


Cvs 


Phe 




610 






615 




Cvs 


Leu. 


Ser Val 


Leu Leu 


Phe 


Pro 


625 






630 






I16U 


Ala 


Gin Gin 


Pro Leu 


Ser 


His 








645 






Thr 




Phe Leu 


Gin Ala 


Ala 


Glu 






660 








Leu 


Ser 


Trp Ala Asp Arg 


Leu 


Ser 






675 






680 


Tm 


Leu 


Val Val 


Leu Leu 


Ala 


Met 




690 






695 




XiJJ 


lyr 


Leu Val 


Ala Phe 


Pro 


Pro 


705 






710 








Pro 


Thr Glu 


Ala Leu 


V d-L 


His 








725 








ul jr 


Leu Ala 


His Ala 


X liX 


A cs-n 

nell 






740 








Leu 


Glv 


Thr Phe 


Leu Val 


Arc 


Ser 






755 






760 






Leu Thr 


Phe Ala 


Met 


Leu 




770 






775 




JET AAC 


Val 

V Ct X 


Pro Leu 


Leu Ala 


Asn 


Val 


785 






790 






oin 


1*1 CU 


Gly Ala Leu Leu 












805 






His 


Leu 


Pro Arg Cys Tyr 


Leu 


Leu 






820 








Pro 


Qlu 


Phe Phe Leu Gly 


Gly 


Gly 






835 






840 


Asp 


Gly 


Asn Thr Gly Asn 


Gin 


Gly 




850 






855 





<210> 20 
<211> 841 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial 
synthetic construct 



<400> 20 












Met Leu 


Leu 


Cys 


Thr 


Ala 


Arg 


Leu 


1 






5 








Cys Cys 


Trp 


Ala 
20 


Phe 


Ala 


Cys 


His 


Thr Leu 


Pro 
35 


Gly 


Asp 


Tyr 


Leu 


Leu 
40 


Gly Cys 


Leu 


Gin 


Val 


Arg 


His 


Arg 


50 










55 




Ser Cys 


Ser 


Phe 


Asn 


Glu 


His 


Gly 


65 








70 






Leu Gly 


Val 


Glu 


Glu 
85 


lie 


Asn 


Asn 


Thr Leu 


Gly 


Tyr 
100 


Gin 


Leu 


Tyr 


Asp 



Gly 


Leu 


Val 


Cys 


Leu 


Gly Leu 


Val 








620 








Glv 


Gin 


Pro 


Ser 


Pro 


Ala Arg 


Cvs 






635 








640 


Leu 


Pro 


Leu 


Thr 


Glv 


Cvs Leu 


Ser 




650 








655 




He 


Phe 


Val 


Glu 


Ser 


Glu Leu 


Pro 


665 










670 




Glv 


Cvs 


Leu 


Arcr 


Glv 


Pro Trp 


Ala 










685 






Leu 


Val 


Glu 


Val 


Ala 


Leu Cys 


Thr 








700 








Glu 


Val 


Val 


Thr 


Asp 


Trp His 


Met 






715 








720 






Thr 


7\ yet 


uCl 


Trr> Val 
up vox 


OCi 




730 








735 




Ala 


Thr 


Leu 


Ala 


Phe 


XJ \Z* Li Nw y & 


Phe 


745 










750 




Gin 


Pro 


Glv 




Tvr 


Son Ara 


Ala 










765 






Ala 


Tvr 

* Jr *■ 


Phe 


He 


Thr 


Trp Vai 


Ser 








780 








Gin 


Val 


Val 


Leu 




Pro Ala 


Val 






795 








800 


Val 


Leu 


Gly 


He 


Leu 


Ala Ala 


Phe 




810 








815 




Met 


Arg 


Gin 


Pro 


Gly 


Leu Asn 


Thr 


825 










830 




Pro 


Gly 


Asp 


Ala 


Gin 


Gly Gin 


Asn 










845 






Lys 


His 


Glu 











Sequence / note = 



Val 


Gly 


Leu 


Gin 


Leu 


Leu He 


Ser 




10 








15 




Ser 


Thr 


Glu 


Ser 


Ser 


Pro Asp 


Phe 


25 










30 




Ala 


Gly 


Leu 


Phe 


Pro 


Leu His 


Ser 










45 






Pro 


Glu 


Val 


Thr 


Leu 


Cys Asp 


Arg 








60 








Tyr 


His 


Leu 


Phe 


Gin 


Ala Met 


Arg 






75 








80 


Ser 


Thr 


Ala 


Leu 


Leu 


Pro Asn 


He 




90 








95 




Val 


Cys 


Ser 


Asp Ser 


Ala Asn 


val 


105 










110 
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Tyr Ala Thr Leu Arg Val Leu Ser Leu Pro Gly Gin His His lie Glu 

115 120 125 

Leu Gin Gly Asp Leu Leu His Tyr Ser Pro Thr Val Leu Ala Val lie 

130 135 140 

Gly Pro Asp Ser Thr Asn Arg Ala Ala Thr Thr Ala Ala Leu Leu Ser 
145 150 155 160 

Pro Phe Leu Val Pro Met lie Ser Tyr Ala Ala Ser Ser Glu Thr Leu 

165 170 175 

Ser Val Lys Arg Gin Tyr Pro Ser Phe Leu Arg Thr lie Pro Asn Asp 

180 185 190 

Lys Tyr Gin Val Glu Thr Met Val Leu Leu Leu Gin Lys Phe Gly Trp 

195 200 205 

Thr Trp lie Ser Leu Val Gly Ser Ser Asp Asp Tyr Gly Gin Leu Gly 

210 215 220 

Val Gin Ala Leu Glu Asn Gin Ala Thr Gly Gin Gly lie Cys lie Ala 
225 230 235 240 

Phe Lys Asp lie Met Pro Phe Ser Ala Gin Val Gly Asp Glu Arg Met 

245 250 255 

Gin Cys Leu Met Arg His Leu Ala Gin Ala Gly Ala Thr Val Val Val 

260 265 270 

Val Phe Ser Ser Arg Gin Leu Ala Arg Val Phe Phe Glu Ser Val Val 

275 280 285 

Leu Thr Asn Leu Thr Gly Lys Val Trp Val Ala Ser Glu Ala Trp Ala 

290 295 300 

Leu Ser Arg His lie Thr Gly Val Pro Gly lie Gin Arg lie Gly Met 
305 310 315 320 

Val Leu Gly Val Ala lie Gin Lys Arg Ala Val Pro Gly Leu Lys Ala 

325 330 335 

Phe Glu Glu Ala Tyr Ala Arg Ala Asp Lys Lys Ala Pro Arg Pro Cys 

340 345 350 

His Lys Gly Ser Trp Cys Ser Ser Asn Gin Leu Cys Arg Glu Cys Gin 

355 360 365 

Ala Phe Met Ala His Thr Met Pro Lys Leu Lys Ala Phe Ser Met Ser 

370 375 380 

Ser Ala Tyr Asn Ala Tyr Arg Ala Val Tyr Ala Val Ala His Gly Leu 
385 3.90 395 400 

His Gin Leu Leu Gly Cys Ala Ser Gly Ala Cys Ser Arg Gly Arg Val 

405 410 415 

Tyr Pro Trp Gin Leu Leu Glu Gin lie His Lys Val His Phe Leu Leu 

420 425 430 

His Lys Asp Thr Val Ala Phe Asn Asp Asn Arg Asp Pro Leu Ser Ser 

435 440 445 

Tyr Asn lie lie Ala Trp Asp Trp Asn Gly Pro Lys Trp Thr Phe Thr 

450 " 455 460 

Val Leu Gly Ser Ser Thr Trp Ser Pro Val Gin Leu Asn lie Asn Glu 
465 470 475 480 

Thr Lys lie Gin Trp His Gly Lys Asp Asn Gin Val Pro Lys Ser Val 

485 ** 490 495 

Cys Ser Ser Asp Cys Leu Glu Gly His Gin Arg Val Val Thr Gly Phe 

500 505 510 

His His Cys Cys Phe Glu Cys Val Pro Cys Gly Ala Gly Thr Phe Leu 

515 520 525 

Asn Lys Ser Asp Leu Tyr Arg Cys Gin Pro Cys Gly Lys Glu Glu Trp 

530 535 540 

Ala Pro Glu Gly Ser Gin Thr Cys Phe Pro Arg Thr Val Val Phe Leu 
545 550 555 560 

Glu Trp His Glu Pro He Ser Leu Val Leu He Ala Ala Asn Thr Leu 
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565 570 575 

Leu Leu Leu Leu Leu Val Gly Thr Ala Gly Leu Phe Ala Trp His Phe 

580 585 590 

His Thr Pro Val Val Arg Ser Ala Gly Gly Arg Leu Cys Phe Leu Met 

595 600 605 

Leu Gly Ser Leu Val Ala Gly Ser Cys Ser Phe Tyr Ser Phe Phe Gly 

610 615 620 

Glu Pro Thr Val Pro Ala Cys Leu Leu Arg Gin Pro Leu Phe Ser Leu 
625 630 635 640 

Gly Phe Ala lie Phe Leu Ser Cys Leu Thr lie Arg Ser Phe Gin Leu 

645 650 655 

Val He He Phe Lys Phe Ser Thr Lys Val Pro Thr Phe Tyr Arg Thr 

660 665 670 

Trp Ala Gin Asn His Gly Ala Gly Leu Phe Val He Val Ser Ser Thr 

675 680 685 

Val His Leu Leu He Cys Leu Thr Trp Leu Val Met Trp Thr Pro Arg 

690 695 700 

Pro Thr Arg Glu Tyr Gin Arg Phe Pro His Leu Val He Leu Glu Cys 
705 710 715 720 

Thr Glu Val Asn Ser Val Gly Phe Leu Leu Ala Phe Thr His Asn He 

725 730 735 

Leu Leu Ser He Ser Thr Phe Val Cys Ser Tyr Leu Gly Lys Glu Leu 

740 745 750 

Pro Glu Asn Tyr Asn Glu Ala Lys Cys Val Thr Phe Ser Leu Leu Leu 

755 760 765 

Asn Phe Val Ser Trp He Ala Phe Phe Thr Met Ala Ser He Tyr Gin 

770 775 780 

Gly Ser Tyr Leu Pro Ala Val Asn Val Leu Ala Gly Leu Thr Thr Leu 
785 790 795 800 

Ser Gly Gly Phe Ser Gly Tyr Phe Leu Pro Lys Cys Tyr Val He Leu 

805 810 815 

Cys Arg Pro Glu Leu Asn Asn Thr Glu His Phe Gin Ala Ser He Gin 

820 825 830 

ABp Tyr Thr Arg Arg Cys Gly Thr Thr 
835 840 

<210> 21 
<211> 840 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note « 
synthetic construct 



<400> 21 














Met Leu Phe 


Trp 


Ala Ala His 


Leu Leu 


Leu 


Ser 


Leu Gin Leu Val Tyr 


1 




5 




10 




15 


Cys Trp Ala 


Phe 


Ser Cys Gin 


Arg Thr 


Glu 


Ser 


Ser Pro Gly Phe Ser 




20 




25 






30 


Leu Pro Gly 


Asp 


Phe Leu Leu 


Ala Gly 


Leu 


Phe 


Ser Leu His Gly Asp 


35 






40 






45 


Cys Leu Gin 


Val 


Arg His Arg 


Pro Leu 


Val 


Thr 


Ser Cys Asp Arg Pro 


50 




55 








60 


Asp Ser Phe 


Asn 


Gly His Gly 


Tyr His 


Leu 


Phe 


Gin Ala Met Arg Phe 


65 




70 






75 


80 


Thr Val Glu 


Glu 


He Asn Asn 


Ser Ser 


Ala 


Leu 


Leu Pro Asn He Thr 
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Leu 


Gly 


Tyr 


Glu 








100 


Ala 


Thr 


Leu 


Arg 






115 




Gin 


LVS 


Asp 


Leu 




130 






Pro 


Asp 


Asn 


Thr 


145 








Phe 


Leu 


Met 


Pro 


Ala 


Lys 


Arg 


Lys 








180 


Wis 
had 


Gin 

V7X11 


Val 

V CLX 


Glu 

VJX U. 






X ^ J 




Tim 


lie 


Ser 


Leu 




210 






Gin 


Ala 


Leu 


Glu 


c> & -J 








Lys 


Asp 


lie 


Val 


Ser 


Met 


Met 


Gin 








260 




Ser 


Asn 


Aircr 
«xy 










Ala 


Asn 


Leu 


Thr 




290 






Ser 


Thr 


Tvt 


lie 


305 








Leu 


Gly 


Val 


Ala 


Glu 


Glu 


Ser 


Tyr 








340 


Clin 

WX U 


Gly 


Ser 


Ttt> 






'ICC 
J J J 






Thr 
i hi. 


Thr* 

X 1X1. 


**xy 




J / u 






Ala 


Tvr 


Ara 


Val 


385 








Gin 


Leu 


Leu 


Gly 




fry) 


Gin 


Leu 








420 


Glu 


Asn 


Thr 


Val 






435 




Asp 


lie 


lie 


Ala 




** J u 






Tip 

lie 


vjxy 




a1 a 


465 








Lys 


lie 


Gin 


Trp 


Thr 


Thr 


Asp 


Cys 








500 


His 


Cys 


Cys 


Phe 






515 





Leu 


Tyr 


Asp 


Val 


Val 


Leu 


Ala 


Leu 








120 


Arc? 


Asn 


His 


Ser 






135 




Asp 


His 


Ala 


Val 




150 






Leu 


Val 


Ser 


TYr 


165 








Phe 


Pro 


Ser 


Phe 


Val 


Met 


Val 


Gin 








200 


He 


Glv 

\jxy 


Ser 


Tvr 






215 




Glu 


Leu 


Ala 


Val 




230 






Pro 


Phe 


Ser 


Ala 


245 








His 


Leu 


Ala 


Gin 


His 


Leu 


Ala 


Arg 








280 


Glv 


Lys 


Val 


Tro 






295 




Thr 


Ser 


Val 


Thr 




310 






Val 


Gin 


Gin 


Arc? 


325 








Val 


Arcr 


Ala 


Val 


Cys 


Ser 


Thr 


Asn 








360 


Asn 


Met 


Pro 


Thr 






375 




TVT 


Glu 


Ala 


Val 




390 






Cys 


Thr 


Ser 


Glu 


405 








Leu 


Gin 


Gin 


He 


Ala 


Phe 


Asp 


Asp 








440 


Trp 


Asp 


Trp 


Asn 






455 




Ser 


Leu 


Ser 


Pro 




470 






His 


Gly 


Lys 


Asn 


485 








Leu 


Ala 


Gly 


His 


Glu 


Cys 


Val 


Pro 



520 





90 






Cvs 


Ser 


Glu 


Ser 


105 








Gin 


Gly 


Pro 


Ara 


Ser 


Lys 


Val 


Val 








140 


Thr 


Thr 


Ala 


Ala 






155 




Glu 


Ala 


Ser 


Ser 




170 






Leu 


**xy 


Thr 


Val 


185 








JJCU 


JJCU 


Gin 


Rial* 


Glv 
v*xy 


Asp 


Tvt* 
Ay x 


Glv 

vjxy 








220 


Pro 


Arg 


Gly 


He 






235 






Val 


Glv 


Asp 




250 






Ala 


/ixy 


Thr 


Thr 


265 








V CLX 


Phe 


Phe 


«xy 


Val 


Ala 


Ser 


Glu 








300 


Gly 


He 


Gin 


Gly 






315 




Gin 


Val 


Pro 


Glv 

vjxy 




330 






Thr 


Ala 


Ala 


Pro 


345 








Gin 


Leu 


Cys 


/*xy 


Leu 


Glv 
uxy 


Ala 


Phe 








380 


Tyr 


Ala 


Val 


Ala 






395 




He 


Cys 


Ser 


Arg 




410 






Tvt 

Jr 


Lvs 


Val 


Asn 


425 








Asn 


Glv 


Asp. 


Thr 


Gly 


Pro 


Glu 


Trp 








460 


Val 


His 


Leu 


Asp 






475 




Asn 


Gin 


Val 


Pro 




490 






His 


Arg 


Val 


Val 


505 








Cys 


Glu 


Ala 


Gly 







95 




Ala 


Asn 


Val 


Tvr 




110 






His 


He 


Glu 


He 


125 








Ala 


Phe 


He 


Gly 


Leu 


Leu 


Gly 


Pro 








160 


Val 


Val 


Leu 


Ser 






175 




JET X W 


Ser 


Asp 






190 






XT 11C 


Glv 
v*xy 




Val 


205 








Gin 

will 


Leu 


Glv 
oxy 


Val 


Cys 


Val 


Ala 


Phe 








240 


Pro 


Arg 


Met 


Gin 






255 




Val 


Val 


Val 


Val 




270 






Ser 


Val 


Val 


Leu 


285 








Asp 




Ala 


He 


He 


Glv 

vjxy 


Thr 


Val 








320 / 


Leu 


Lys 


Glu 


Phe 






335 




Ser 


Ala 


Cys 


Pro 




350 






ox u. 


^»y 


His 


Thr 


365 








Ser 


Met 


Ser 


Ala 


xix a 


Gl v 
wxy 


DC u. 


His 

£1X EJ 








400 


Glv 
uxy 


Pro 


Val 


Tvr 
j.yx 






415 




Phe 


Leu 


Leu 


His 




430 






Leu 


Glv 


Tvr 
j 


Tvr 


445 








Thr 


Phe 


Glu 


He 


lie 


A an 
Moil 


T .WQ 

xiy o 


Thr 

X11X 








480 


Val 


Ser 


Val 


Cys 






495 




Val 


Gly 


Ser 


His 




510 






Thr 


Phe 


Leu 


Asn 



525 
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Met 


Ser 


Glu 


Leu 


His 


He 


Cys 


Gin 


Pro 


Cys 


Gly 


Thr Glu 


Glu 


Trp Ala 




530 










535 










540 






Pro 


Lys 


Glu 


Ser 


Thr 


Thr 


Cys 


Phe 


Pro 


Arg 


Thr 


Val Glu 


Phe 


Leu Glu 


545 










550 










555 






560 


Leu 


Arg 


Glu 


His 


Thr 


Ser 


Trp 


Val 


Leu 


Leu 


Ala 


Ala Asn 


Thr 


Leu Leu 










565 










570 








575 


Leu 


Leu 


Leu 


Leu 


Leu 


Gly 


Thr 


Ala 


Gly 


Leu 


Phe 


Ala Trp 


His 


Leu Asp 








580 










585 








590 




Thr 


Pro 


Val 


Val 


Arg 


Ser 


Ala 


Gly 


Gly 


Arg 


Leu 


Cys Phe 


Leu 


Met Leu 






595 










.600 








605 






Gly 


Ser 


Leu 


Ala 


Ala 


Gly 


Ser 


Gly 


Ser 


Leu 


Tyr 


Gly Phe 


Phe 


Gly Glu 




610 










615 










620 






Pro 


Thr 


Arg 


Pro 


Ala 


Cys 


Leu 


Leu 


Arg 


Gin 


Ala 


Leu Phe 


Ala 


Leu Gly 


625 










630 










635 






640 


Phe 


Thr 


He 


Phe 


Leu 


Ser 


Cys 


Leu 


Thr 


Val 


Arg 


Ser Phe 


Gin 


Leu He 










645 










650 








655 


He 


He 


Phe 


Lys 


Phe 


Ser 


Thr 


Lys 


Val 


Pro 


Thr 


Phe Tyr 


His 


Ala Trp 








660 










665 








670 




Val 


Gin 


Asn 


His 


Gly 


Ala 


Gly 


Leu 


Phe 


Val 


Met 


He Ser 


Ser 


Ala Ala 






675 










680 








685 






Gin 


Leu 


Leu 


He 


Cys 


Leu 


Thr 


Trp 


Leu 


Val 


Val 


Trp Thr 


Pro 


Leu Pro 




690 










695 










700 






Ala 


Arg 


Glu 


Tyr 


Gin 


Arg 


Phe 


Pro 


His 


Leu 


Val 


Met Leu 


Glu 


Cys Thr 


705 










710 










715 






720 


Glu 


Thr 


Asn 


Ser 


Leu 


Gly 


Phe 


He 


Leu 


Ala 


Phe 


Leu Tyr 


Asn 


Gly Leu 










725 










730 








735 


»Leu 


Ser 


He 


Ser 


Ala 


Phe 


Ala 


Cys 


Ser 


Tyr 


Leu 


Gly Lys 


Asp 


Leu Pro 








740 










745 








750 




Glu 


Asn 


Tvr 


Asn 


Glu 


Ala 


Lys 


Cys 


Val 


Thr 


Phe 


Ser Leu 


Leu 


Phe Asn 






755 










760 








765 






Phe 


Val 


Ser 


Trp 


He 


Ala 


Phe 


Phe 


Thr 


Thr 


Ala 


Ser Val 


Tyr 


Asp Gly 




770 










775 










780 






Lys 


Tyr 


Leu 


Pro 


Ala 


Ala 


Asn 


Met 


Met 


Ala 


Gly 


Leu Ser 


Ser 


Leu Ser 


785 










790 










795 






800 


Ser 


Gly 


Phe 


Gly 


Gly 


Tyr 


Phe 


Leu 


Pro 


Lys 


Cys 


Tyr Val 


He 


Leu Cys 










805 










810 








815 


Arg 


Pro 


Asp 


Leu 


Asn 


Ser 


Thr 


Glu 


His 


Phe 


Gin 


Ala Ser 


He 


Gin Asp 








820 










825 








830 




Tyr 


Thr 


Arg 


Arg 


Cys 


Gly 


Ser 


Thr 















835 840 



<210> 22 
<211> 838 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 22 

Met Gly Pro Arg Ala Lys Thr He Cys Ser Leu Phe Phe Leu Leu Trp 

1 " 5 10 15 

Val Leu Ala Glu Pro Ala Glu Asn Ser Asp Phe Tyr Leu Pro Gly Asp 

20 25 30 

Tyr Leu Leu Gly Gly Leu Phe . Ser Leu His Ala Asn Met Lys Gly He 
35 40 45 
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Val 


His 


Leu 


Asn 


Phe 


Leu 


Gin 


Val 


Pro 


Met Cys 


Lvs 


Glu 


Tyr 


Glu Val 




50 










55 








60 








Lys 


Val 


He 


Glv 


Tvi? 
xyx 


Asn 


Leu 


Met 


Gin 


Ala Met 


Ara 


Phe 


Ala 


Val Glu 


65 










70 








75 








80 


Glu 


lie 


As ii 


Asn 


Asp 


Ser 


Ser 


Leu 


Leu 


Pro Gly 


Val 


Leu 


Leu 


Gly Tyr 










85 










90 








95 


Glu 


lie 


Val 


ASD 


Val 


Cvs 


Tyr 


He 


Ser 


Asn . Asn 


Val 


Gin 


Pro 


Val Leu 








100 










105 








110 




Tvr 


Phe 


Leu 


Ala 


His 


Glu 


ASD 


Asn 


Leu 


Leu Pro 


He 


Gin 


Glu 


Asp Tyr 






115 










120 








125 






Ser 


As n 


Tvi" 
xyx 


He 


Ser 


Ara 


Val 


Val 


Ala 


Val He 


Gly 


Pro 


Asd 


Asn Ser 




130 










135 








140 








Glu 


Ser 


Val 


Met 


Thr 


Val 


Ala 


Asn 


Phe 


Leu Ser 


Leu 


Phe 


Leu 


Leu Pro 


145 










150 








155 








160 


Gin 


lie 


Thr 


xyx. 


Ser 


Ala 


He 


Ser 


Asp 


Glu Leu 


Arcr 


Asd 

""XT 


Lvs 

m jr o 


Val Arg 










165 










170 








175 


Phe 


Pro 


Ala 


Leu 


Leu 


Ara 


Thr 


Thr 


Pro 


Ser Ala 


Asp 


His 


His 


Val Glu 








180 










185 








190 




Ala 


Met 


Val 


Gin 


Leu 


Met 


Leu 


His 


Phe 


Arcr Tm 


Asn 


Trrj 


He 


He Val 






195 










200 








205 






Leu 


Val 


Ser 


Ser 


Asp 


Thr 


Tvr 

xyx 


Glv 


Arcr 


Asd Asn 


Glv 


Gin 


Leu 


Leu Gly 




210 










215 








220 








vj J- Li 


Arg 


val 


A 1 Sri 


At*<tt 


Arcr 

*vtrg 


Bar* 


He 




He Ala 


Phe 


Gin 


Glu 


Thr Leu 


225 










230 








235 








240 


If X \J 




xjcu 


G 1 n 


Prn 

XT X U 


Asn 


Gin 


Asn 


Met 


Thr Ser 


Glu 


Glu 


Arg Gin Arg 










245 










250 








255 


Leu 


Val 


Thr 


He 


Val 


Asp 


Lys 


Leu 


Gin 


Gin Ser 


Thr 


Ala 


Arg Val Val 








260 










265 








270 




Val 


Val 


Phe 


Ser 


Pro 


Asp 


Leu 


Thr 


Leu 


Tyr His 


Phe 


Phe 


Asn 


Glu Val 






275 










280 








285 






Leu 




Gin 


Asn 


Phe 


Thr 


Glv 


Ala 


Val 


Trp He 


Ala 


Ser 


Glu 


Ser Trp 




290 










295 








300 








Ala 


lie 


Asp 


Pro 


Val 


Leu 


His 


Asn 


Leu 


Thr Glu 


Leu 


Glv 


His Leu Gly 


305 










310 








315 








320 


Thr* 

XIX J. 


Phe 


Tien 


Glv 

>J JL Jf 


He 


Thr 


He 


Gin 


Ser 


Val Pro 


He 


Pro 


Gly Phe Ser 










325 










330 








335 


Glu 


Phe 




Glu 


xxlv 


Glv 


Pro 


Gin 


Ala 


Gly Pro 


Pro 


Pro 


Leu 


Ser Arg 








340 










345 








350 




Thy 

XXXX 


OCX 


Gin 

WJLXX 


Ser 


xyx 


Thr 


Cys 


Asn 


Gin 


Glu Cvs 

V»J JU U4 W Jr 


Asp 


Asn 


Cys 


Leu Asn 






355 










360 








365 






nlo 


Thr 


Leu 


Ser 


Phe 


Asn 


Thr 


He 


Leu 


Ara Leu 


Ser 


Glv 


Glu Arg Val 




370 










375 








380 








Val 


Tvy/ 
xyi 


Ser 


Val 


Tvr 
xyx 


Ser 


Ala 


Val 


Tvr 


Ala Val 


Ala 


His 


Ala 


Leu His . 


385 








390 








395 








400 


Ser 


Leu 


Leu 


Glv 


Cys 


Asp 


Lys 


Ser 


Thr 


Cys Thr 


Lvs 


Ara 


Val 


Val Tyr 










405 










410 








415 


Pro 


xxp 


Gin 


Leu 


Leu 


Glu 


Glu 


He 


Tru 


Lys Val 


Asn 


Phe 


Thr 


Leu Leu 








420 










425 








430 




nop 


(IX o 


Gin 


He 

lie 


Phe 


Phe 


.Asp 


Pro 


Gin 


Gly Asp 


Val 


Ala 


Leu His Leu 






435 










440 








445 






Glu 


He 


Val 


Gin 


Trp 


Gin 


Trp 


Asp 


Arg 


Ser Gin 


Asn 


Pro 


Phe 


Gin Ser 




450 










455 








460 








Val 


Ala 


Ser 


Tyr 


Tyr 


Pro 


Leu 


Gin 


Arg 


Gin Leu 


Lys 


Asn 


He Gin Asp 


465 










470 








475 








480 


lie 


Ser 


Trp 


His 


Thr 


Val 


Asn 


Asn 


Thr 


He Pro 


Met 


Ser 


Met 


Cys Ser 








485 










490 








495 


Lys 


Arg 


Cys 


Gin 


Ser 


Gly 


Gin 


Lys 


Lys 


Lys Pro 


Val 


Gly 


He His Val 
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500 



Cys 


Cys 


Phe 


Glu 


Cys 


He 


Asp 


Cys 






515 










520 


Thr 


Glu 


Asp 


Glu 


Tyr 


Glu 


Cys 


Gin 




530 










535 




Tyr 


Gin 


Ser 


Glu 


Thr 


Ser 


Cys 


Phe 


545 










550 






His 


Glu 


Val 


Pro 


Thr 


He 


Val 


Val 










565 








Phe 


Ser 


Thr 


Leu 


Ala 


He 


Leu 


Phe 








580 










Pro 


Met 


Val 


Arcr 


Ser 


Ala 


Gly 


Gly 






595 










600 


Pro 


Leu 


Leu 


Leu 


Ala 


Phe 


Gly 


Met 




610 










615 




Thr 


Val 


Phe 


Ser 


Cys 


Phe 


Cys 


Ara 


625 










630 






Ser 


lie 


Cvs 


Leu 


Ser 


Cvs 


He 


Thr 










645 








Val 


Phe 


Lys 


Met 


Ala 


Ara 


Ara 


Leu 








660 










Arg 


Tyr 


His 


Glv 


Pro 


Tvr 


Val 


Phe 






675 










680 


Val 


Ala 


Leu 


Val 


Val 


Glv 


Asn 


Met 




690 










695 




Gly Arg 


Thr 


Asp 


Pro 


Asp 


Asp 


Pro 


705 










710 






Pro 


Asn 


Tvr 


Ara 


Asn 


Glv 


Leu 


Leu 










725 








Leu 


Ser 


Val 


Leu 


Glv 


Phe 


Ser 


Phe 








740 










Thr 


Asn 


Tvr 


Asn 


Glu 


Ala 


Lys 


Phe 






75 5 










760 


Phe 


Thr 


Ser 


Ser 


He 


Ser 


Leu 


Cvs 




770 










775 




Val 


Leu 


Val 


Thr 


He 


Met 


Asp 


Leu 


785 










790 






Ala 


He 


Gly 


Leu 


Gly 


Tyr 


Phe 


Gly 










805 








Tyr 


Pro 


Glu 


Arg 


Asn 


Thr 


Ser 


Ala 








820 










Tyr Thr 


Met 


Arg 


Lys 


Ser 







835 



505 






510 


Leu 


Pro Gly 


Thr Phe Leu Asn His 








525 


Ala 


Cys 


Pro 


Asn Asn Glu Trp Ser 








540 


Lvs 


Arg Gin 


Leu Val Phe Leu Glu 






555 


560 


Ala 


He 


Leu 


Ala Ala Leu Glv Phe 




570 




575 


He 


Phe 


Trp 


jiyct His Phe Gin Thr 


585 






590 


Pro 


Met 


Cys 


Phe Leu Met Leu Val 








605 


Val 


Pro 


Val 


Tvr Va 1 Ol v Pt*o PtT) 
i/i veil vjijr nw xr *-\J 








620 


Gin 


Ala 


Phe 


Phe Thr Val Cvs Phe 

cue x in v ax o rue 






635 


640 


Val 


Arg Ser 


Phe Gin Tie Val Pva 




650 




655 

OSS 


p rn 

Xri 


Ser Ala 


TVr-v- Qar PVlO TTm 

lyi Ocl xr lie lip rlcU 








670 


v ai 


Ala 


Phe 


Tie Thr* Ma Tie T.vh 
ne in jl Ala lie J_>yt3 








685 


Ti£&11 
JJC u 


Ala 


Thr 


J. Ill lie noil riu lie 








700 


Asn 


He 


Met 


Tie Leu Ser* f*vs Wis 






715 


720 


Phe 


Asn 


Thr 


Ser Met Asn Leu Leu 




730 




735 


Ala 


Tyr Met 


Ol v TiVfl nl ii Tien Drn 
wijr u jr o will ueu riu 


745 






750 


Tie 


Thr 


Leu 


cpr Met - TTiT* PVte Qor 
oci ncu x ill rue oei 








765 


J. Hi 


Phe 


Met 


oci Val xliS /VBp VJly 








780 


Leu 


Val 


Thr 


Val Leu Asn Phe Leu 






795 


800 


Pro 


Lys 


Cys 


Tyr Met lie Leu Phe 




810 




815 


Tyr 


Phe 


Asn 


Ser Met He Gin Gly 


825 






830 



<210> 23 
<211> 843 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note o 
synthetic construct 

<400> 23 

Met Gly Pro Gin Ala Arg Thr Leu Cys Leu Leu Ser Leu Leu Leu His 

1 5 .10 15 

Val Leu Pro Lys Pro Gly Lys Leu Val Glu Asn Ser Asp Phe His Leu 
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20 



Ala 


vjiy 


Asp 


Tyr 


Leu 


Leu 


wiy oiy 






35 








40 


Lys 


Ser 


lie 


Ser 


His 


Leu 


Ser Tyr 




50 










55 


Pne 


Thr 


Met 


Lys 


vai 


Leu 


Gly Tyr 


65 










70 




Ala 


Val 


Glu 


Glu 


lie 


Asn 


Asn Cys 










85 






Leu 


Gly 


Tyr 


Glu 


Met 


Val 


Asp Val 








100 








Pro 


Gly 


Leu 


Tyr 


Phe 


Leu 


Ala Gin 






115 








120 


Lys 


Asp 


Tyr 


Ser 


Gin 


Tyr 


Met Pro 




130 










135 


Asp 


Asn 


Ser 


Glu 


Ser 


Ala 


lie Thr 


145 










150 




Leu 


lie 


Pro 


Gin 


lie 


Thr 


Tyr Ser 










165 






Lys 


Arg 


His 


Phe 


Pro 


Ser 


Met Leu 








180 








His 


lie 


Glu 


Ala 


Met 


Val 


Gin Leu 






195 








200 


lie 


Val 


Val 


Leu 


Val 


Ser 


Asp Asp 




210 










215 


Leu 


Leu 


Ser 


Gin 


Arg 


Leu 


Thr Lys 


225 










230 




Gin 


Glu 


Val 


Leu 


Pro 


He 


Pro Glu 










245 






Qlu 


Gin 


Arg 


Gin 


Leu 


Asp 


Asn lie 








260 








Ala 


Arg 


Val 


Val 


Val 


Val 


Phe Ser 






275 








280 


Phe 


His 


Glu 


Val 


Leu 


Arg 


Trp Asn 




290 










295 


Ser 


Glu 


Ser 


Trp 


Ala 


He 


Asp Pro 


305 










310 




Arg 


His 


Thr 


Gly 


Thr 


Phe 


Leu Gly. 










325 






Pro 


Gly 


Phe 


Ser 


Gin 


Phe 


Arg Val 








340 








Val 


Pro 


Asn 


Thr 


Thr 


Asn 


Leu Arg 






355 








360 


Ala 


Cys 


Leu 


Asn 


Thr 


Thr 


Lys Ser 




370 










375 


Gly 


Glu 


Arg 


Val 


Val 


Tyr 


Ser Val 


385 










390 




His 


Ala 


Leu 


His 


Arg 


Leu 


Leu Gly 










405 






Gin 


Lys 


Val 


Tyr 


Pro 


Trp 


Gin Leu 








420 








Phe 


Thr 


Leu 


Leu 


Gly 


Asn 


Arg Leu 






435 








440 


Pro 


Met 


Leu 


Leu 


Asp 


He 


He Gin 




450 










455 


Pro 


Phe 


Gin 


Ser 


He 


Ala 


Ser Tyr 


465 










470 





25 






30 




Leu 


Phe Thr Leu 


HIS 


Ala 


Asn Val 






45 






Leu Gin Val Pro 


Lys 


Cys 


Asn Glu 




60 








Asn 


Leu Met Gin 


Ala 


Met 


Arg Phe 




75 






80 


Ser 


Ser Leu Leu 


Pro 


Gly 


Val Leu 




90 






95 


Cys 


Tyr Leu Ser 


Asn 


Asn 


He His 


105 






110 




Asp Asp Asp Leu 


Leu 


Pro 


He Leu 






125 






His 


Val Val Ala 


Val 


He 


Gly Pro 




140 








Val 


Ser Asn He 


Leu 


Ser 


His Phe 




155 






160 


Ala 


He Ser Asp 


Lys 


Leu 


Arg Asp 




170 






175 


Arg 


Thr Val Pro 


Ser 


Ala 


Thr His 


185 






190 




Met 


Val His Phe 


Gin 


Trp 


Asn Trp 






205 






Asp Tyr Gly Arg 


Glu 


Asn 


Ser His 




220 








Thr 


Ser Asp He 


Cys 


He 


Ala Phe 




235 






240 


Ser 


Ser Gin Val 


Met 


Arg 


Ser Glu 




250 






255 


Leu 


Asp Lys Leu 


Arg 


Arg 


Thr Ser 


265 






270 




Pro 


Glu Leu Ser 


Leu 


Tyr 


Ser Phe 






285 






Phe 


Thr Gly Phe 


Val 


Trp 


He Ala 




300 








Val 


Leu His Asn 


Leu 


Thr 


Glu Leu 




315 






320 


Val 


Thr He Gin 


Arg 


Val 


Ser He 




330 






335 


Arg 


Arg Asp Lys 


Pro 


Gly 


Tyr Pro 


345 






350 




Thr 


Thr Cys Asn 


Gin 


Asp 


Cys Asp 






365 






Phe 


Asn ABn He 


Leu 


He 


Leu Ser 




380 








Tyr 


Ser Ala Val 


Tyr 


Ala 


Val Ala 




395 






400 


Cys 


Asn Arg Val 


Arg 


Cys 


Thr Lys 




410 






415 


Leu 


Arg Glu He 


Trp 


His 


Val Asn 


425 






430 




Phe 


Phe Asp Gin 


Gin 


Gly 


Asp Met 






445 






Trp 


Gin Trp Asp 


Leu 


Ser 


Gin Asn 




460 








Ser 


Pro Thr Ser 


Lys 


Arg 


Leu Thr 




475 






480 
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Tvr 


He 


Asn 


Asn 


Val 


Ser 


Tm 


TVr 

A/X 


Thr 


Pro Asn 


Asn Thr 


Val 


Pro 


Val 










485 










490 






495 




Ser 


Met 


Cys 


Ser 


Lys 


Ser 


Cys 


Gin 


Pro 


Gly Gin 


Met Lys 


Lvs 
uy 0 


Ser 


Val 








500 










505 






510 






Qly 


TiPti 
xic u 


His 


Pro 




Cys 


Phe 


Glu 


Cys 


Leu Asd 


Cvs Met 


Pro 


Glv 


Thr 






51 5 










520 






525 








Tvr 
xyx 


Tien 


Z\ fin 

AD11 


Arg 


Ser 
OCX 


Ala 


Asp 


Glu 


Phe 


Asn Cvs 


Leu Ser 


Cys 


Pro 


Glv 
vjxy 




J -J V 










535 








540 








OCX 


Met* 


irp 


Cat* 
OCX 


iyr 


Lys 


Aon 


Ann 


He 


Thr Cys 


Phe Gin 


Arg 


Arer 
Arg 


IT X w 


545 










550 








555 








560 


Thr 
nil 


tf 11C 


DC U 


Ul U 


HP KM 


His 
nx o 


Glu 

VJl u 


Ala 
nia 


Pro 


Thr He 


Ala Val 


Ala 
Ala 


xie u 


xieu 










565 










570 






575 




Ala 


Ala 


Leu 


Glv 


Phe 


Leu 


Ser 


Thr 


Leu 


Ala lie 


Leu Val 


He 


Phe 


Trn 








580 










585 






590 






Arci 


His 


Phe 


Gin 


Thr 


Pro 


He 


Val 


Arci 


Ser Ala 


Gly Gly 


Pro 


Met 


Cys 






595 










600 






605 








Phe 


Leu 


Met 


Leu 


Thr 


Leu 


Leu 


Leu 


Val 


Ala Tyr 


Met Val 


Val 


Pro 


Val 




610 










615 








620 










V CLX 


fll V 




riu 


xiy o 


Val 
v ax 


Cpr 
OCX 


Thr 

1 11X 


o xieu. 


C*\ra Arci 


Gl n 

will 


Al a 
Aid 


XI C Ll 


O A J 


















635 








640 


XT 11C 


rl \J 


UC LL 


>— jr ES 


rr lie 


Thr 
mi 


Tl e 

11C 


L> jr a 


Tie 

X1C 


ocx uyo 


Tie Ala 

X X C nlct 


Val 
v ax 


Arg 


OCX 










645 










650 






655 




.rile 


Pi _ 
oJ.Il 


Tl o 

lie 


Vol 


Cys 


1\1 a 
Ax a 


flic 


Lys 


rlcu 


Ala Ser 


A vrr Dhp 

Hiy file 


Pro 


Arg 


A 1 a 
Ala 








DOU 






















Tyr 


Ser 


Tyr 


irp 


Val 
v ai 


Arg 


Ayr 


V3111 


fll V 

wiy 




Val Qpr 
vai oei 


i*ie l. 


Al a 
Aia 


Phe 






675 










680 






685 








Tip 


Thr 

J.11X ' 


Val 

V Ct X 


Tien 
XI c u 


Xt y O 


Met 


Val 


He 


Val 


Val He 


Glv Mer 

V3 1 y 1*1 C L> 


Leu 


Ala 


Thr 




690 










695 








700 








uljr 


XIC Ll 


OCX 


xr x 


XliX 


iiu 


Arg 


Thr 

11XX 


A nri 

Hop 


Pro A qti 


nop rl u 


T,VH 

xiy 0 


Tl e' 
ne 


Thr 

1111 


705 










710 








715 








720 


Tl o 
lie 


Val 
V Ul 


Ser 


v^y B 


Ann 


rlO 


Aon 
noil 


iyr 


A TTT 

Arg 


Aon Cpr 
iiDll OCX 


TiOii T1P11 
XJCU XJCU 


Jrile 


A 0 m 
noil 


Thr 

lllx 










725 










730 






735 




O CI 


XJCU 


nbLJ 


XJC Ll 


xieu 


XJC CI 


OCX 


Val 
v ax 


Val 
val 


fll • Dho 
vjiy it lie 


Opr Oh a 
ocx rue 


Al a 
nia 


iyr 


I'ICL 








740 










745 






750 








T.v/a 
xiy a 


U1U 


XI C Ll 


Pro 

xrX vj 


Thr 

X11X 


A an 

fiO 1-1 


Tvr 

iyr 


21 en 

noil 


fill it 2ila 
wiu Aid 


Xiy 0 JrllB 


Tl e 
lie 


Thr 

111X 


xieu 






755 










760 






765 








Qpr 


nc l» 


Thr 

1U1 


Phe 


lyr 


Phe 


Thr 


Ser 


Ser 


Val Ser 


Tien fva 

XIC U vjr C3 


Thr 

X11X 


Phe 


Met 
11c u 




770 










775 








780 








Ser 


Ala 


Tyr 


Ser 


Gly 


Val 


Leu 


Val 


Thr 


He Val 


Asp Leu 


Leu 


Val 


Thr 


785 










790 








795 








800 


val 


Leu 


Asn 


Leu 


Leu 


Ala 


He 


Ser 


Leu 


Gly Tyr 


Phe Gly 


Pro 


Lys 


Cys 










805 










810 






815 




Tyr 


Met 


He 


Leu 


Phe 


Tyr 


Pro 


Glu 


Arg 


Asn Thr 


Pro Ala 


Tyr 


Phe 


Asn 








820 










825 






830 






Ser 


Met 


He 


Gin 


Gly 


Tyr 


Thr 


Met 


Arg 


Arg Asp 











835 840 



<210> 24 

<211> 853 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 24 
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Met 


Leu 


Gly 


Pro 


Ala 


Val 


Leu 


Gly 


1 








5 








Pro 


Glv 
uiy 


Thr 


Glv 


Ala 


Pro 


Leu 


Cvs 








20 










uiy 


nop 


Tvr 


Val 


Leu 


Glv 
oiy 


Glv 


Leu 
















40 


Ala 


Glv 


Leu 


Ara 


Ser 


Ara 


Thr 


Arg 




50 










55 




Phe 


Sex* 


Ser 


Asn 


Glv 

vaxy 


Leu 


Leu 


Trp 


65 










70 






Glu 


Glu 


lie 


Asn 


Asn 


Lys 


Ser 


ASD 










85 








Tvr 


Asp 


Leu 


Phe 


Asp 


Thr 


Cys 


Ser 








100 










Ser 


Leu 


Mat- 


ir lie 


LlC U. 


HI a 
Aid 


T,VB 

XJjr o 


Ala 

nil* 






113 










120 


Cys 


Asn 


Tyr 


inr 


m n 
ulu 


Tyr 


m n 


Drrt 
xrx *J 




lj u 










lit; 




TT -i n 
HIS 


Ser 


Ser 


vJXU 


Leu 


Ala 




Val 

V CLl 


x *± o 










150 






Leu 


Mat- 


Fro 


(31 n 


veil 


Car 

OCX 


xyr 


Glv 
uiy 










1 5 

ID 3 








Arg 


rsi ii 


IHX 


XT XlC 


xrl \J 


Car 
OCX 


Phe 


Phe 








1 AO 

lOV 










ni -n 


Leu 


TVi r 
XII J. 


nlo 


nla 


2\1 a 

nX OL 


(31 u 

V71 IX 


Leu 






1 95 










200 


IT- T 

vai 


ax a 


IV 1 a 

iua 


L6U 


uiy 


Car 
OCX 




Ann 




Z 1U 










215 

«13 




jl ie 


rile 


Ser 


211 a 
Aid 


Leu 


ai a 

Aid 


Ala 
ni c*. 


Ala 


<G <6 3 










230 






oxy 


ucu 


Val 


Pro 


Leu 


Pro 


«iy 


Ala 










245 








m n 

V3XI1 


ASp 


Val 


Leu 


His 


Gin 


Val 


Asn 








260 










T.oi i 
UCU 


.true 


Ala 


Owl 


Val 


His 


Ala 


Ala 






275 










280 


Cor 
OCX 


OCX 


2VTVT 

nl y 


JJCU 


Cpr 


Pro 


Lys 


Val 














295 




Thr 


Ser 


Asp 


Leu 


Vol 


Mot* 

wet 


uiy 


JJCU 












310 






vai 


Leu 


uxy 


rflc 


Leu 


OlXL 


Arg 


rjl t/ 










35 5 








Tyr 


Va 1 


Lys 


Xlix 


nib 


Leu 


Al a 

iilCl 


T.pn 








J *r VJ 










Hi a 
Al cl 


Leu 


Ijiy 


Olu 

VjX U 


Arg 


Glu 


Gin 


Glv 






355 










360 


Arg 


Cys 


Pro 


Gin 


Cys 


Asp 


Cys 


He 




370 










375 




Leu 


Asn 


His 


His 


Gin 


Thr 


Phe 


Ser 


385 










390 






Ala 


Gin 


Ala 


Leu 


His 


Asn 


Thr 


Leu 










405 








Ala 


Gin 


Asp 


Pro 


Val 


Lys 


Pro 


Trp 








420 










Leu 


Thr 


Phe 


His 


Val 


Gly 


Gly 


Leu 



Leu 


Ser 


Leu 


Trp 


Ala 


Leu 


Leu 


His 




10 










15 




Leu 


Ser 


Gin 


Gin 


Leu 


Arg 


Met 


Lys 


25 










30 






Phe 


Pro 


Leu 


Gly 


Glu 


Ala 


Glu 


Glu 










45 








Pro 


Ser 


Ser 


Pro 


Val 


Cys 


Thr 


Arg 








60 










Ala 


Leu 


Ala 


Met 


Lys 


Met 


Ala 


Val 






75 










80 


Leu 


Leu 


Pro 


Gly 


Leu 


Arg 


Leu 


Gly 




90 










95 




Glu 


Pro 


Val 


Val 


Ala 


Met 


Lys 


Pro 


105 










110 






Glv 


Ser 


Ara 


ASD 


He 


Ala 


Ala 


Tvr 










125 










Val 


Leu 


Ala 


Val 


He 


Glv 


Pro 








140 










Thr 


Glv 
vjxy 


Lys 


Phe 


Phe 


Ser 


Phe 


Phe 






155 










160 


Ala 


Ser 


Met 


Glu 


Leu 


Leu 


Ser 


Ala 




170 










175 




Ara 


Thr 


Val 


Pro 


Ser 


Asd 


Ara 


Val 


185 










190 






Leu 


Gin 


Glu 


Phe 


Glv 


Tro 


Asn 


Tna 










205 








Glu 




Glv 

V3iy 


Ara 


Gin 


Glv 


Leu 


Ser 








220 










Ara 


Glv 


He 


Cys 


He 


Ala 


His 


Glu 






235 










240 


Asp 


ASD 


Ser 


Ara 


Leu 


Gly 


Lys 


Val 




250 










255 




Gin 


Ser 


Ser 


Val 


Gin 


Val 


Val 


Leu 


265 










270 






His 


Ala 


Leu 


Phe 


Asn 


Tyr 


Ser 


He 










285 








Tro 


Val 


Ala 


Ser 


Glu 


Ala 


Tro 


Leu 








300 










Pro 


Glv 


Met 


Ala 


Gin 


Met 


Glv 

vji y 


Thr 






315 










320 


Ala 

nla 


Gin 




His 


Glu 


Phe 


Pro 


Gin 




330 










335 




Ala 


Thr 


Asp 


Pro 


Ala 


Phe 


Cys 


Ser 


345 










350 






Leu 


Glu 


Glu 


Asd 


Val 


Val 


Glv 


Gin 










365 








Thr 


Leu 


Gin 


Asn 


Val 


Ser 


Ala 


Gly 








380 










Val 


Tyr 


Ala 


Ala 


Val 


Tyr 


Ser 


Val 






395 










400 


Gin 


Cys 


Asn 


Ala 


Ser 


Gly 


Cys 


Pro 




410 










415 




Gin 


Leu 


Leu 


Glu 


Asn 


Met 


Tyr 


Asn 


425 










430 






Pro 


Leu 


Arg 


Phe 


Asp 


Ser 


Ser 


Gly 
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a«n Vai 
ABU Vol 


ASp 


l T JC u uiu ±y x 


450 






vox JrXU 


Arg 


T t eii a Bon 
ucu. xi id nop 


465 




470 


f3l n 7\ rn 
ox u niy 


UCU 


T.vp Tip 7\ >-rr 
uy o x x c *vl y 






485 


Got" Tl T~n 
OCX riry 


Cys 


OCX r\JL y VJXI1 






500 
j y v 


uiy irne 


TT -J m 

ax S 


rSffl Pira 

ocx t-ys uys 








Tyr Arg 


Gin 


Asn Fro Asp 








Glu Trp 


Ser 


Pro Glu Arg 


Cite 

54b 




CCA 


nVft T Ait 

Pne Lieu 


G1U 


Trp Gly Glu 






565 


Cys Leu 


vai 


iicu oiy xieu 






con 
sou 


Tyr Trp 


Asp 


oer fro x*eu 




CQC 




xrlie bly 


Leu 


lie LTyS lieu 


D1U 






Pro Gly 


Arg 


• 

Pro Arg Ser 


62 5 




c ^ rt 


TjJ n T All 

HxB Li&U 


Pro 


Lieu mr oiy 






04O 


uiu ne 


irfie 


\7a 1 f2l n Cor 
veil VjlU OCX 






660 

O O \J 


Cys Ser 


Tyr 


Lieu Arg vaiy 




D / 0 




inr lieu 


vai 


Pi ii Hi a ala 

uiu Aia Aia 


can 

fa y u 






Pro Glu 


vai 


Vai Thr Asp 


705 




710 


His Cys 


Arg 


Met Arg Ser 






725 


Asn Ala 


vai 


Leu Ala Phe 






I *± u 


ser Gin 


pro 


uiy Arg lyr 




•7CC 




Lieu Aia 


Tyr 


fuc lie lie 


/ / u 






vai vim 


vai 


Aia lyr uin 


TAR 
/CD 




7<50 


Cys Ala 


Leu 


Gly He Leu 






805 


Leu Leu 


Trp 


Leu Pro Glu 






820 


Ser Pro 


Lys 


Glu Ala Ser 




835 




Arg Gly 


His 


Ser Glu 


850 







<210> 25 
<211> 857 
<212> PRT 



440 



Asp Leu Lys Leu 


Trp 


Vai 


455 




460 


Vai Gly Arg Phe 


Asn 


Gly 




475 




Trp His Thr Ser 


Asp 


Asn 


490 






Cys Gin Glu Gly 


Gin 


Vai 


505 






Tyr Asp Cys Vai 


Asp 


Cvs 


• 520 






Asn Tie Ala Cvs 

no^ ^J.w m» jl 


Thr 


Phe 






540 


Ser Thr Arcr Cvs 


Phe 


Ara 




555 

J J J 




Py"o Al a Vai Leu 


Ser 


Leu 


570 






Thr Leu Ala Ala 


Leu 


Glv 


585 






Vai Gin Ala Ser 


Glv 


Glv 


600 






Gly Leu Phe Cys 


Leu 


Ser 


615 
w x J 




620 


Ala Ser Cys. Leu 


Ala 


Gin 




635. 




Cvs Leu Ser Thr 


Leu 


Phe 


650 






Glu Leu Pro Leu 


Ser 


Trp 


665 






Pro Trp Ala Trp 


Leu 


Vai 


680 






Leu Cys Ala Trp 


Tvr 


Leu 


6 Q5 




700 


Tm Ol n Vai Leu 


Pro 


Thr 




71 5 
/ xo 




XXJJ VCLX OCX XICU 


Glv 
«iy 


Tien 

XJC u 


730 






T»eu Pvs . Phe Leu 


Glv 
oxy 


Thr 


74 5 






Asn Arcr Ala Ara 


Glv 


Leu 


760 






Trp Vai Ser Phe 


Vai 


Pro 


775 




780 


Pro Ala Vai Gin 


Met 


Gly 




795 




Ala Thr Phe His 


Leu 


Pro 


810 






Leu Asn Thr Gin 


Glu 


Phe 


825 






Asp Gly Asn Ser 


Gly 


Ser 



840 



PCT/US2004/025459 

445 



Trp 


Gin 


Gly 


Ser 


Ser 


Leu 


Arg 


Thr 








480 


Gin 


Lys 


Pro 


Vai 






495 




Arq 


Arg 


Vai 


LVS 




510 






GlU 


Ala 


Glv 


Ser 


525 








Cys 


Glv 

vjo.y 


Gin 


Asp 


Arg 


Arg 


Ser 


Arg 








560 


Leu 


Leu 


Leu 


Leu 






575 




Leu 


Phe 


Vai 


His 




590 






Ser 


Leu 


Phe 


Cvs 


605 








Vai 


Leu 


Leu 


Phe 


Gin 


Pro 


Met 


Ala 








640 


Leu 


Gin 


Ala 


Ala 






655 




Ala 


Asn 


Tro 


Leu 




670 






Vai 


Leu 


Leu 


Ala 


685 








Met 


Ala 


Phe 


Pro 


Glu 


Vai 


Leu 


Glu 








720 


V CLX 


xix a 


Tl e 
x xc 


XilX 






7^5 
/ o o 




Phe 


Leu 


Vai 


Gin 




750 






Thr 


Phe 


Ala 


Met 


765 








Leu 


Tien 

JJC LA 


Ala 


Asn 


Ala 


He 


Leu 


Phe 








800 


Lys 


Cys 


Tyr 


Vai 






815 




Phe 


Leu 


Gly 


Arg 




630 






Ser 


Glu 


Ala 


Thr 



845 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note « 
synthetic construct 



<400> 25 



Met* 


Pro 


biy 


Leu 


al a 
Ala 


IXC 


Ti on 
JJCU 


gi v T»f>it 


Qpr 

OCX 


JJCU nia Aia 


pVia Tien Gl ti 
rue ucu uiu 


1 








O 








X \J 




19 


uc u 


gi v 




Ol v 


Ser 


Ser 


Leu 


rVfi Tieti 


Ser 


Gin Gin Phe 

wXU wXU XrllC 


Iath Ala Ol n 

J-JjT O A»JLa uiu 






















S U 


Glv 
crxy 


Hop 


ryr 


Tl o 
1 1C 


JJCU 


en v 

wljf 


Glv 

vJX jr 


TiPil Phe 


XrX KJ 


■ xicu vjrxy nix 


TVir* Gin Gl n 

X11X V3J.U ox u 






•5 O 














AC 




Ala 
/ix a 


1 ill 


T.on 

JJCU 


rtD 11 


Gin 


Aig 


Thr* 

1 11JL 


Gin Pro 


Asn 


Gl v Tie Tipn 
wxy xxc jjcu 


(Vb Thr* Ar*a 
wjtd xjjx rtiy 




en 

OU 










3D 






fin 




xrlic 


Cor 

OCX 




T.on 


v*iy 




irllc 


r.pii jxT a 

JJCU nl» 




J\1 a Mot" TiVa 
ax a i*jc u uy a 


Moh Ala Val 
i*i c u nia vax 


0 D 


















/ 3 


on 
OU 




gi ii 




Asn 


Asn 


Gl ir 


Ser 


& 1 a T.on 
Ala jjcu 


JJCU 


Dro GT TiAii 
JrxiJ u±y Ucu 


Axg jjc u biy 


















on 




' QC. 
73 


Tyr 


Asp 


Leu 


DVlO 
XT XiC 


Asp 


1I1ZV 


■ 

Cys 


Cor Gl n 


xrX <J 


Val Val Thr- 
vax vax iiix 


Mot* T ■t/'O lD"f*^*4 
wet jjyB JrxO 








i a a 
100 








i n c 
1U5 






110 


Ser 


Leu 


wet 


Fne 




Ala 


Lys 


X7al nlv 

Val wiy 


Ser 


rjln Cor Tl o 

uin oer lie 


Aia Aia lyr 






115 










120 




IOC 

125 




Cys 


Asn 


Tyr 


inr 


Ulu 


Tyr 


viin 


Fro Arg 


A7a 1 

vai 


jjcu Aia vai 


lie wiy iyro 




130 










135 






140 




niS 


Ser 


Ser 


VilU 


Leu 


Til a 

Aia 


Leu 


lie inr 


oiy 


jjys Fne jpne 


oer Fne Fne 


145 










i c a 
150 








ICC 

155 


1 C A 

loO 


Leu 


wet 


Fro 




vai 


Ser 


Tyr 


oer Aia 


Ser 


Mec Asp Arg 


Leu Ser Asp 










n ^c 
lOD 








1 / u 




1 *7C 

175 


Arg 


vjJLU 


inr 


File 


Pro 


Ser 


XT HQ 


jrnc Arg 


i hi 


X7a T Dr^x C Ar 

Val irro ocr 


Asp Arg vai 








lo U 








1 QC 

lob 






iy u 


tjin 


Leu 


nl n 


al a 
Ala 


vai 


X7a 1 
Val 


inr 


JJCU JJCU 


Lain 


Aon O V\ a O a v 

ASU irGc ocr 


Www* A en fp v" 

irp Asn irp 
















Zuu 




O A C 

205 




If** 1 

vai 


A 1 « 


Al a 
Ala 


Leu 


uiy 


Ser 


Asp 


ASp ASp 


Tyr 


vyiy Arg uiu 


biy jjcu oer 




1 1 A 
210 










O 1 c 

215 






220 




Tl O 




Ser 


Gl \r 


Leu 


Al a 

in. j. a 


Asn 


Car A r*rr 
OCX ax y 


vjtx y 


Tl o f^va T ~\ o 
xxc Ljro XXC 


A 1 a Wt a Gl ti 
Axa m a uxu 


Z Z 0 


















9 1 C. 
ZOO 


o a n 

Z**\) 


vfiy 


Leu 


Vdl 


Pro 


gi n 


n± a 


A on 
""It 


T^riv Cor 
U1X OCX 


wxy 


Gl ti Gl n T.on 
wXll UlU JJCU 


Gl v T ,\/o T7a 1 

oiy uyo vax 


















ZDU 




c; c 
ZOO 


Vol 


A ays 


V ct J. 


Leu 




will 


Val 
vax 


Ann 1 n 
no ll o X 11 


OCX 


t,vo Va 1 Gl n 
uyo vax Vjiu 


Va 1 Va 1 Va 1 
vax vax vax 








ZoU 








2 DO 






0*7 A 
2 /U 


Leu 


Fne 


Aia 


Ser 


A.1 a 
Hid 


Arg 


Aia 


vai ryr 


Ser 


T.on Dno Oor 

Jjcu Jriic ocr 


iyr oer lie 






275 










*■> a a 
280 




285 




Leu 


His 


Asp 


Leu 


Ser 


Pro 


Lys 


Val Trp 


Val 


Ala Ser Glu 


Ser Trp Leu 




290 










295 






300 




Thr 


Ser 


Asp 


Leu 


Val 


Met 


Thr 


Leu Pro 


Asn 


He Ala Arg 


Val Gly Thr 


305 










310 








315 


320 


Val 


Leu 


Gly 


Phe 


Leu 


Gin 


Arg 


Gly Ala 


Leu 


Leu Pro Glu 


Phe Ser His 










325 








330 




335 


Tyr 


Val 


Glu 


Thr 


Arg 


Leu 


Ala 


Leu Ala 


Ala 


Asp Pro Thr 


Phe Cys Ala 








340 








345 






350 


Ser 


Leu 


Lys 


Ala 


Glu 


Leu 


Asp 


Leu Glu 


Glu 


Arg Val Met 


Gly Pro Arg 






355 










360 




365 




Cys 


Ser 


Gin 


Cys 


Asp 


Tyr 


He 


Met Leu 


Gin 


Asn Leu Ser 


Ser Gly Leu 




370 










375 






380 




Met 


Gin 


Asn 


Leu 


Ser 


Ala 


Gly 


Gin Leu 


His 


His Gin He 


Phe Ala Thr 


385 










390 








395 


400 
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Ala 


Ala 


Val 


Tvr 
405 


Ser 


Val 


Ala 


^■j ° 


Asn 


Val 


Ser 
420 


His 


Cys 


His 


Thr 


Leu 


Leu 


Glu 
435 


Asn 


Met 


Tvr 

xy x 


Asn 


Met 
440 


Leu 


Gin 
450 


Phe 


Asp 


Ala 


Lvs 


Glv 
455 


Ser 


Met 


Tiro 


Val 


Trb 


Gin 


Ser 


Pro 


Thr 


465 










470 






Phe 


Asn 


Glv 


Thr 


Leu 
485 


Gin 


Leu. 


Gin 


Asn 


Gin 


Val 


Pro 
500 


Val 


Ser 


Gin 


Cvs 


Val 




«xy 
515 


Val 


Lys 


Glv 


Phe 


His 
520 


Cys 


Lys 
530 


Ala 


Glv 


Ser 


Tvr 


Ara 
535 


Lvs 


Pro 


Cys 


Glv 


Lys 


Asp 


Gin. 


Tiro 


Ser 


545 










550 






Pro 


Ara 


Ara 


Pro 


Lys 
565 


Phe 


Leu 


Glu 


iicu 


T. ah 




Tien 

580 


Leu 


Ser 


Leu 


Ala 




Leu 


595 


Val 


I1X O 


XIX o 


Ary 


Hop 

600 


« xy 


Pro 
610 


Leu 


AT a 




Phe 
rue 


Glv 
615 


Leu 


Ser 


V ci X 


Leu 


UC Li 


Phe 


Jr X \J 


Gly 


Gin 


625 










630 






m n 

VJXU 


Gin 


Pro 


Leu 


Ser 
645 


His 


Leu 


Pro 


Phe 


Leu 


Gin 


Ala 
660 


Ala 


Glu 


He 


Phe 


irp 


Ala 

/VX Ct 


Ann 
675 




Leu 


Ser 


Glv 

vjxy 


Cys 
680 


V dJL 


Val 
690 


Leu 


Leu 


Ala 


Met 


Leu 
695 


Val 


Leu 


Val 


Ala 


Phe 


Pro 


Pro 


Glu 


Val 


705 










710 






Thr 


Glu 


Ala 


Leu 


Val 
725 


His 


Cvs 


Ara 


Leu 


Ala 


His 


Ala 
740 


Thr 


Asn 


Ala 


Thr 


Thy 
1.111. 


Phe 

JTilC 


Leu 
755 


Val 


Ar*a 
*vxy 


Ser 


Gin 


Pro 
760 




Th-r 

XllX 

770 




Ala 


Met 


Leu 


Ala 
775 


Tvr 
xy x 




Leu 




AT a 




Val 

V CLX 


Gl n 

will 


Val 


785 










790 






Gly 


Ala 


Leu 


Leu 


Leu 
805 


Cys 


Val 


Leu 


Pro 


Arg 


Cys 


Tyr 
820 


Leu 


Leu 


Met 


Arg 


Phe 


Phe 


Leu 
835 


Gly 


Gly 


Gly 


Pro 


Gly 
840 


Asri 


Thr 


Gly 


Asn 


Gin 


Gly 


Lys 


His 



Gin 


Ala 


Leu 


His Asn 


Thr 


Leu 


Gin 




410 








415 




Ser 


Glu 


Pro 


Val Gin 


Pro 


Trp 


Gin 


425 








430 






Ser 


Phe 


Arg 


Ala Arg 


Asp 


Leu 


Thr 








445 








Val 


Asp 


Met 


Glu Tyr 


Asp 


Leu 


Lys 








460 








Pro 


Val 


Leu 


His Thr 


Val 


Gly 


Thr 






475 








480 


His 


Ser 


Lys 


Met Tyr 


Trp 


Pro 


Gly 

2 




490 








495 




Ser 


Arg 


Gin 


Cys Lys 


Asp 


Gly 


Gin 


505 








510 






Ser 


Cys 


Cys 


Tyr Asp 


Cys 


Val 


Asp 








525 








His 


Pro 


Asp 


Asp Phe 


Thr 


Cys 


Thr 








540 








Pro 


Glu 


Lys 


Ser Thr 


Thr 


Cys 


Leu 






555 








560 


Trp 


Gly 


Glu 


Pro Ala 


Val 


Leu 


Leu 




570 








575 




Leu 


Glv 


Leu 


Val Leu 


Ala 


Ala 


Leu 


585 








590 






Ser 


Pro 


Leu 


Val Gin 


Ala 


Ser 


Glv 








605 








Val 


Cvs 


Leu 


Gly Leu 


Val 


Cvs 


Leu 








620 








Pro 


Ser 


Pro 


Ala Arg 


Cys 


Leu 


Ala 






635 








640 


Leu 


Thr 


Glv 


Cys Leu 


Ser 


Thr 


Leu 




650 








655 




Val 


Glu 


Ser 


Glu Leu 


Pro 


Leu 


Ser 


665 








670 






Leu 


Ara 


Gly 


Pro Trp 


Ala 


Tiro 


Leu 








685 








Glu 


Val 


Ala 


Leu Cys 


Thr 


Trp 


Tyr 








700 








Val 


Thr 


Asp 


Trp His 


Met 


Leu 


Pro 






715 








720 


Thr 


Arg 


Ser 


Trp Val 


Ser 


Phe 


Gly 




730 








735 




Leu 


Ala 


Phe 


Leu Cys 


Phe 


Leu 


Gly 


745 








750 






Glv 


Ara 


Tvr 


Asn Arg 


Ala 


Ara 


Glv 








765 








Phe 


He 


Thr 


Trp Val 


Ser 


Phe 


Val 








780 








Val 


Leu 


Arg 


Pro Ala 


Val 


Gin 


Met 






795 








800 


Gly 


He 


Leu 


Ala Ala 


Phe 


His 


Leu 




810 








815 




Gin 


Pro 


Gly 


Leu Asn 


Thr 


Pro 


Glu 


825 








830 






Asp 


Ala 


Gin 


Gly Gin 


Asn 


Asp 


Gly 



845 



Glu 
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850 855 

<210> 26 

<211> 840 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 



<400> 26 




















Met 


Gly 


Pro 


Arg 


Ala 


Lys 


Thr 


He Cys 


Ser Leu 


Phe Phe 


Leu Leu Trp 


1 








5 










10 




15 


Val 


Leu 


Ala 


Glu 


Pro 


Ala 


Glu 


Asn 


Ser 


Asp Phe 


Tyr Leu 


Pro Gly Asp 








20 










25 






30 


Tyr 


Leu 


Leu 


Gly 


Gly 


Leu 


Phe 


Ser 


Leu 


His Ala 


Asn Met 


Lys Gly He 






35 










40 






45 




Val 


His 


Leu 


Asn 


Phe 


Leu 


Gin 


Val 


Pro 


Met Cys 


Lys Glu 


Tyr Glu Val 




50 










55 








60 




Lys 


Val 


He 


Gly 


Tyr 


Asn 


Leu 


Met 


Gin 


Ala Met 


Arg Phe 


Ala Val Glu 


65 










70 








75 




80 


Glu 


lie 


Asn 


Asn 


Asp 


Ser 


Ser 


Leu 


Leu 


Pro Gly 


Val Leu 


Leu Gly Tyr 










85 










90 




95 


Glu 


lie 


Val 


Asp 


Val 


Cys 


Tyr 


He 


Ser 


Asn Asn 


Val Gin 


Pro Val Leu 








100 










105 






110 


Tyr 


Phe 


Leu 


Ala 


His 


Glu 


Asp 


Asn 


Leu 


Leu Pro 


He Gin 


Glu Asp Tyr 






115 










120 






125 




Ser 


Asn 


Tyr 


He 


Ser 


Arg 


Val 


Val 


Ala 


Val He 


Gly Pro 


Asp Asn Ser 




130 










135 








140 




Glu 


Ser 


Val 


Met 


Thr 


Val 


Ala 


Asn 


Phe 


Leu Ser 


Leu Phe 


Leu Leu Pro 


145 










150 








155 




160 


Gin 


lie 


Thr 


Tyr 


Ser 


Ala 


He 


Ser Asp 


Glu Leu 


Arg Asp 


Lys Val Arg 










165 










170 




175 


Phe 


Pro 


Ala 


Leu 


Leu 


Arg 


Thr 


Thr 


Pro 


Ser Ala 


Asp His 


His Val Glu 








180 










185 






190 


Ala 


Met 


Val 


Gin 


Leu 


Met 


Leu 


His 


Phe 


Arg Trp 


Asn Trp 


He He Val 
















200 






205 




Leu 


Val 


Ser 


Ser 


Asp 


Thr 


Tyr 


Gly Arg 


Asp Asn 


Gly Gin 


Leu Leu Gly 




210 










215 








220 




Glu 


Arg 


Val 


Ala 


Arg 


Arg 


Asp 


He Cys 


He Ala 


Phe Gin 


Glu Thr Leu 


225 










230 








235 




240 


Pro 


Thr 


Leu 


Gin 


Pro 


Asn 


Gin 


Asn 


Met 


Thr Ser 


Glu Glu 


Arg Gin Arg 










245 










250 




255 


Leu 


Val 


Thr 


He 


Val 


Asp 


Lys 


Leu 


Gin 


Gin Ser 


Thr Ala 


Arg Val Val 








260 










265 






270 


Val 


Val 


Phe 


Ser 


Pro 


Asp 


Leu 


Thr 


Leu 


Tyr His 


Phe Phe 


Asn Glu Val 






275 










280 






285 




Leu 


Arg 


Gin 


Asn 


Phe 


Thr 


Gly 


Ala 


Val 


Trp He 


Ala Ser 


Glu Ser Trp 




290 










295 








300 




Ala 


He 


Asp 


Pro 


Val 


Leu 


His 


Asn 


Leu 


Thr Glu 


Leu Gly 


His Leu Gly 


305 










310 








315 




320 


Thr 


Phe 


Leu 


Gly 


He 


Thr 


He 


Gin 


Ser 


Val Pro 


He Pro 


Gly Phe Ser 










325 










330 




335 


Glu 


Phe 


Arg 


Glu 


Trp 


Gly 


Pro 


Gin Ala 


Gly Pro 


Pro Pro 


Leu Ser Arg 








340 










345 






350 


Thr 


Ser 


Gin 


Ser 


Tyr 


Thr 


Cys 


Asn 


Gin 


Glu Cys 


Asp Asn 


Cys Leu Asn 
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\ 55 

DDD 










360 




ft! a 


inr 


Leu 


Cor 
OCX 


flic 


A a ti 
Ad LI 


Thr 

111X 


He 

xxc 


Leu 




J / u 










175 

DID 






Wal 


Tyr 


Ser 


Val 


Tyr 


Ser 


nld 


V ctx 


lyr 


JO J 










390 

J J w 








Ser 


Leu 


Leu 


Gly 


Cys 


Asp 


Lys 


Ser 


Thr 










405 










f xo 


Trp 


OJ.Il 


ticu 


LIC U. 


Glu 

Ui. u. 


Glu 


He 


Tro 

xxp 








AO ft 










425 


Asp 


nxS 


yin 


lie 


Dhe» 
flic 


flit; 




ri VJ 


Gin 






435 










440 




Glu 


He 


Val 


Gin 


Trp 


Gin 


Trp 


Asp 


Arg 




A t? O 










*a «3 -J 






Val 




Ser 


Tyr 


Tyr 


• 

Pro 




m n 




4oo 










Am 
% iU 








Tl a. 

lie 


Coy* 


Trp 


nlB 




VQl 


Abu 


Asn 


Thr 










ABC 










Lys 


TV -v~/-» 

Arg 


Cys 


/TlT vi 
ulu 


Ser 


fill ir 


vim 


Lys 


Lys 








cnn 
DU u 










5 05 


Cys 


Cys 


pne 


G1U 


Cys 


Tie 
11c 


Asp 


Cys 


Leu 






513 










con 




Thr 


G1U 


Asp 


G1U 


Tyr 


lilU 


Cys 


UlU 


al a 

Hid 




C ^ A 

b J U 
















Tyr 


villi 


O A V 

&er 


rtT ii 

UlU 


inr 


Ser 


Cys 


flic 


Lys 


CAR 
04kO 










c C n 








Leu 


Arg 


ulU 


IllS 


inr 


Ser 


Trp 


Va 1 
Veil 


Leu 










OOO 










Leu 


Leu 


Leu 


Leu 


Leu 


Giy 


T>Viv- 

inr 


711 a 

Aia 


vjiy 








con 










OOO 


Thr 


Pro 


vai 


val 


Arg 


Ser 


Til a 

Aia 


flT i/ 

Giy 


Giy 






C Q C 










DUU 




Gly 


Ser 


Leu 


Ala 


TV "I — 

Ala 


Giy 


Ser 


/Til ir 

Giy 


Ser 




610 










bio 






Pro 


Thr 


TV ■*-/-» 

Arg 


pro 


Aia 


Cys 


Leu 


Leu 


Arg 


o2o 


















Pne 


Thr 


lie 


pne 


Leu 


Ser 


Cys 


Leu 


xnr 










GA 5 
Ori O 










lie 


He 


Pne 


Lys 


pne 


ser 


inr 


Lys 


17a T 

vai 








ooU 










OOO 


Val 


Gin 


Asn 


His 


Gly 


Aia 


Giy 


Leu 


pne 






CI c 
O ID 










can 

oow 




Gin 


Leu 


Leu 


lie 


Cys 


Leu 


inr 


Trp 


LIC LI 




Oi? U 










O J70 






TV "1 „ 

Ala 


TV v/-t 

Arg 


blU 


Tyr 


r-1 Ti 

urn 


Arg 


pne 


Pro 


TTJ „ 

nib 


•7 t\ C 

705 










/ 1U 








G1U 


mr 


Asn 


Ser 


Leu 


vaiy 


flic 


lie 


XJCUL 










725 










Leu 


Ser 


He 


Ser 


Ala 


Phe 


Ala 


Cys 


Ser 








740 










745 


Glu 


Asn 


Tyr 


Asn 


Glu 


Ala 


Lys 


Cys 


Val 






755 










760 




Phe 


Val 


Ser 


Trp 


He 


Ala 


Phe 


Phe 


Thr 




770 










775 






Lys 


Tyr 


Leu 


Pro 


Ala 


Ala 


Asn 


Met 


Met 
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365 



Arcr 


Leu 


Ser 


Gly 


Glu 


Arg 


Val 






380 










Ala 


Val 


Ala 


His 


Ala 


Leu 


His 




395 










400 


Cys 


Thr 


Lys 


Arg 


Val 


Val 


Tyr 


410 










415 




Lys 


Val 


Asn 


Phe 


Thr 


Leu 


Leu 










430 






Gly 


Asp 


Val 


Ala 


Leu 


His 


Leu 








445 








Ser 


Gin 


Asn 


Pro 


Phe 


Gin 


Ser 






** o w 










will 


ucu 


Xi jr O 


ion 


Tie 
xxc 


Gin 


Asp 




475 










480 


Tie 

X 1C 


Pro 

JTX \J 


Met 

1*1 C l» 


Ser 


Met 


Cys 


Ser 












495 




T.vo 


Pro 


Val 


Glv 
oxy 


He 


His 


Val 










510 






Pro 


Glv 


Thr 


Phe 


Leu 


Asn 


His 








525 








Cys 


Pro 


Asn 


Asn 


Glu 


Tro 


Ser 






540 










Aro 


Gin 


Leu 


Val 


Phe 


Leu 


Glu 




555 










560 


Leu 


Ala 


Ala 


Asn 


Thr 


Leu 


Leu 


O f u 










DID 




Leu 


Phe 


Ala 


LL V 


His 


Leu 


Asp 










D -7 V 








Leu 


Cys 


Phe 


Leu 


Met 


Leu 








o WO 








ucu 




uiy 


Phe 

fllC 


Phe 
rue 


Glv 
oxy 


Glu 






D <6 U 










vjlll 


Til a 
Ala 


UCU 


fllC 


AT a 


ucu 


Glv 
v*iy 














640 


Va 1 
v cii 


A rn 

Hxg 


OCL 


Phe 
rue 


VJXll 


uc u 


He 
xxc 


0 OVJ 










655 
OOO 




Urn 


X11X 


Phe 

rue 


TVr 

xyir 


His 


Ala 


lip 
















Val 


Met 


He 


Ser 


Ser 


Ala 


Ala 
















Val 


Val 




Thr 


Pro 


Leu 


Pro 






700 










Leu 


Val 


Met 


Leu 


Glu 


Cvs 


Thr 




715 










720 


Ala 


Phe 


Leu 


±yx 


Asn 


Glv 


Leu 


/Ju 










715 
/ J o 




Tyr 


Leu 


Gly 


Lys 


Asp 


Leu 


Pro 










750 






Thr 


Phe 


Ser 


Leu 


Leu 


Phe 


Asn 








765 








Thr 


Ala 


Ser 


Val 


Tyr 


Asp 


Gly 






780 










Ala 


Gly 


Leu 


Ser 


Ser 


Leu 


Ser 




795 










800 
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Ser Gly Phe Gly Gly Tyr Phe Leu Pro Lys Cys Tyr Val lie Leu Cys 

805 810 815 

Arg Pro Asp Leu Asn Ser Thr Glu His Phe Gin Ala Ser He Gin Asp 

820 825 830 

Tyr Thr Arg Arg Cys Gly Ser Thr 
835 840 

<210> 27 
<211> 840 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note « 
synthetic construct 

<400> 27 

Met Leu Leu Cys Thr Ala Arg Leu Val Gly Leu Gin Leu Leu He Ser 

1 5 10 15 

Cys Cys Trp Ala Phe Ala Cys His Ser Thr Glu Ser Ser Pro Asp Phe 

20 25 30 

Thr Leu Pro Gly Asp Tyr Leu Leu Ala Gly Leu Phe Pro Leu His Ser 

35 40 45 

Gly Cys Leu Gin Val Arg His Arg Pro Glu Val Thr Leu Cys Asp Arg 

50 55 60 

Ser Cys Ser Phe Asn Glu His Gly Tyr His Leu Phe Gin Ala Met Arg 
65 70 75 80 

Leu Gly Val Glu Glu He Asn Asn Ser Thr Ala - Leu Leu Pro Asn He 

85 90 95 

Thr Leu Gly Tyr Gin Leu Tyr Asp Val Cys Ser Asp Ser Ala Asn Val 

100 105 110 

Tyr Ala Thr Leu Arg Val Leu Ser Leu Pro Gly Gin His His He Glu 

115 120 125 

Leu Gin Gly Asp Leu Leu His Tyr Ser Pro Thr Val Leu Ala Val He 

130 135 140 

Gly Pro Asp Ser Thr Asn Arg Ala Ala Thr Thr Ala Ala Leu Leu Ser 
145 150 . 155 160 

Pro Phe Leu Val Pro Met He Ser. Tyr Ala Ala Ser Ser Glu Thr Leu 

165 170 175 

Ser Val Lys Arg Gin Tyr Pro Ser Phe Leu Arg Thr He Pro Asn Asp 

180 185 190 

Lys Tyr Gin Val Glu Thr Met Val Leu Leu Leu Gin Lys Phe Gly Trp 

195 200 205 

Thr Trp He Ser Leu Val Gly Ser Ser Asp Asp Tyr Gly Gin Leu Gly 

210 215 220 

Val Gin Ala Leu Glu Asn Gin Ala Thr Gly Gin Gly He Cys He Ala 
225 230 235 240 

Phe Lys Asp He Met Pro Phe Ser Ala Gin Val Gly Asp Glu Arg Met 

245 250 255 

Gin Cys Leu Met Arg His Leu Ala Gin Ala Gly Ala Thr Val Val Val 

260 265 270 

Val Phe Ser Ser Arg Gin Leu Ala Arg Val Phe Phe Glu Ser Val Val 

275 280 285 

Leu Thr Asn Leu Thr Gly Lys Val Trp Val Ala Ser Glu Ala Trp Ala 

290 295 300 

Leu Ser Arg His He Thr Gly Val Pro Gly He Gin Arg He Gly Met 
305 310 315 320 
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Val 


Leu 


Gly 


Val 


Ala 
325 


He 


Gin 


Lys 


Arg 


Ala 
330 


Val 


Pro 


Gly 


Leu Lys Ala 
335 


Phe 


Glu 


Glu 


Ala 
340 


Tvr 


Ala 


Ara 


Ala 


Asp 
345 


Lvs 


Lvs 


Ala 


Pro 


Arg Pro Cys 
350 


His 


Lys 


Glv 
355 


Ser 


Trrj 

xxp 


Cvs 


Ser 


Ser 
360 


Asn 


Gin 


Leu 


Cys 


Ara 
365 


Glu Cvs Gin 


Ala 


Phe 
370 


Met 
ric L* 


Ala 


His 


Thr 


Met 
375 


Pro 


Lvs 


Leu 


Lvs 


Ala 

380 


Phe 


Ser Met Ser 


Ser 


Ala 


xyr 


Asn 


Ala 


Tvr 


Ara 


Ala 


Val 


Tvr 


Ala 


Val 


Ala 


His Glv Leu 


385 










390 










395 






400 


His 


Gin 


Leu 


Leu 


Glv 

vjiy 
405 


Cys 


Ala 


Ser 


Gly 


Ala 
410 


Cvs 


Ser 


Ara 


Gly Arg Val 
415 


Tvr 


Pro 


Tm 


Gin 
420 


Leu 


Leu 


Glu 


Gin 


He 
425 


His 


Lvs 


Val 


His 


Phe Leu Leu 
430 


His 


Lys 


Asp 
435 


Thr 


Val 


Ala 


Phe 


Asn 
440 




Asn 


Ara 


Asp 


Pro 
445 


Leu Ser Ser 


Tvr 


Asn 
450 


lie 


lie 


Ala 


Tm 


Asp 
455 


Tm 


Asn 


Glv 


Pro 


Lys 
460 


Tro 


Thr Phe Thr 


Val 


Leu 


Glv 


Ser 


Ser 


Thr 


Tm 


Ser 


Pro 


Val 


Gin 


Leu 


Asn 


Tie Asn Glu 


465 










470 










475 






480 


Thr 

111X 




Tie 

X 1C 


Gin 


Tm 

485 


His 


Glv 
v*xy 


Lys 


Asp 


Asn 
490 


Gin 


Val 


Pro 


TiVS Ser Val 
ujr o ocx v ax 

495 




OCX 


Cav> 
OCX 


500 


. Pvci 

JT 


Tien 


Glu 


Glv 
oxy 


His 
505 


Gin 


Ara 
Axg 


Val 

val 


Val 

v ax 


Thr Gl v Phe 
nix wxy riic 

510 


His 


His 


»— y o 

515 


^ jr o 


Phe 
it lie; 


Glu 


^»y o 


Val 
520 


Pro 


Cys 


Glv 


Ala 


Glv 
vjx y 

525 


Thr" Ph*a TtOii 

1111 lr 11C UCU 


7\ c ti 
noli 


530 


Gar 
QCI 


Asp 


Leu 


Tvr 
lyx 


Ara 

535 


Cys 


Gin 


Pro 


Cys 


Glv 
540 


Lys 


Glu Glu Trn 

win WlU 11U 


nla 


Pro 


Gl 11 
UJ.U 


gi v 

vjxy 


Cav> 
OC1 


Gl Tl 
V3X11 


Thr 




Phe 


Pro 


Ara 
Axg 


Thr 


Val 
v ax 


Val D"h<a Tien 
val iriic lie IX 


545 










550 










555 






560 


gi ii 




His 
mo 


ox u 


Ala 
565 


Prn 


Thr 


He 


Ala 


Val 
570 


Ala 


Leu 


Leu 


Ala Ala lieu 
ai c* ma ucu 

575 


Glv 




Leu 


Ser 
580 


Thr 


Leu 


Ala 


lie 


Leu 
585 


Val 


He 


Phe 


Tm 

ixp 


Ara His Phe 
590 


Gin 


Thr 


Pro 
595 


He 


Val 


Ara 
«xy 


Ser 


Ala 
600 


Glv 

VJX J 


Glv 


Pro 


Met 


Cys 
605 


Phe Leu Met 


Leu 


Thr 
610 


Leu 


Leu 


Leu 


Val 


Ala 
615 


Tvr 


Met 


Val 


Val 


Pro 
620 


Val 


Tyr Val Gly 


Pro 


Pro 


Lys 


Val 


Ser 


Thr 


Cvs 


Leu 


Cys 


Ara 


Gin 


Ala 


Leu 


Phe Pro Leu 


625 










630 










635 






640 


V-y a 


Phe 


Thr 

1111 


Tie 

11C 


Cys 
645 


He 


Ser 


Cys 


He 


Ala 
650 


Val 


Ara 
Axg 


Ser 


Phe Gin Tie 

JT lis* VJ 111 

655 


Val 
val 


ijys 


Al a 


Phe 
660 


u y o 


Met- 
ric u 


Ala 


Ser 


Ara 
665 


Phe 


Pro 


Ara 
Arg 


Ala 

Axa 


Tvr Ser Tvr 
lyx ocx lyi 

670 


Trp 


val 


Arg 
675 


iyr 


Gin 

13111 


Gl v 
oiy 


JrX U 


iyr 
680 


val 


OCX 


Met 

1*1 c u 


Al a 


Phe 
685 


Tie Thr Val 
xxc i in val 


XICLL 


Lys 
690 


Met 


Val 

V ax 


He 


Val 


Val 
695 


He 


Glv 


Met 


Leu 


Ala 
700 


Thr 


Gl v Leu Ser 

vjijr ucu oci 


■c 1 tj 


1111 


Thr 

X11X 


Ara 


Thr 


Asp 


Pro 


Asp 


Asn 


Pro 


Lys 


lie 


Thr 


Tie Val Ser 

lie vai aei 


705 










710 










715 






720 


Cys 


Asn 


Pro 


Asn 


Tyr 
725 


Arg 


Asn 


Ser 


Leu 


Leu 
730 


Phe 


Asn 


Thr 


Ser Leu Asp 
735 


Leu 


Leu 


Leu 


Ser 
740 


Val 


Val 


Gly 


Phe 


Ser 
745 


Phe 


Ala 


Tyr 


Met 


Gly Lys Glu 
750 


Leu 


Pro 


Thr 
755 


Asn 


Tyr 


Asn 


Glu 


Ala 
760 


Lys 


Phe 


He 


Thr 


Leu 
765 


Ser Met Thr 


Phe 


Tyr 


Phe 


Thr 


Ser 


Ser 


Val 


Ser 


Leu 


Cys 


Thr 


Phe 


Met 


Ser Ala Tyr 



41 



WO 2005/015158 



PCT/US2004/025459 



770 



775 



780 



Ser Gly Val Leu Val Thr lie Val Asp Leu Leu Val Thr Val Leu Asn 
785 790 795 800 

Leu Leu Ala lie Ser Leu Gly Tyr Phe Gly Pro Lys Cys Tyr Met lie 

805 810 815 

Leu Phe Tyr Pro Glu Arg Asn Thr Pro Ala Tyr Phe Asn Ser Met lie 

820 825 830 

Gin Gly Tyr Thr Met Arg Arg Asp 



<210> 28 
<211> 1123 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 28 

Leu Gin Val Arg His Arg Pro Glu Val Thr Leu Cys Asp Arg Ser Cys 

1 5 10 15 

Ser Phe Asn Glu His Gly Tyr His Leu Phe Gin Ala Met Arg Leu Gly 

20 25 30 

Val Glu Glu lie Asn Asn Ser Thr Ala Leu Leu Pro Asn lie Thr Leu 

35 40 45 

Gly Tyr Gin Leu Tyr Asp Val Cys Ser Asp Ser Ala Asn Val Tyr Ala 

50 55 60 

Thr Leu Arg Val Leu Ser Leu Pro Gly Gin His His lie Glu Leu Gin 
65 70 75 80 

Gly Asp Leu Leu His Tyr Ser Pro Thr Val Leu Ala Val lie Gly Pro 

85 90 95 

Asp Ser Thr Asn Arg Ala Ala Thr Thr Ala Ala Leu Leu Ser Pro Phe 

100 105 110 

Leu Val Pro Met lie Ser Tyr Ala Ala Ser Ser Glu Thr Leu Ser Val 

115 120 125 

Lys Arg Gin Tyr Pro Ser Phe Leu Arg Thr lie Pro Asn Asp Lys Tyr 

130 135 140 

Gin Val Glu Thr Met Val Leu Leu Leu Gin Lys Phe Gly Trp Thr Trp 
145 150 155 160 

lie Ser Leu Val Gly Ser Ser Asp Asp Tyr Gly Gin Leu Gly Val Gin 

165 170 175 

Ala Leu Glu Asn Gin Ala Thr Gly Gin Gly lie Cys lie Ala Phe Lys 

180 185 190 

Asp lie Met Pro Phe Ser Ala Gin Val Gly Asp Glu Arg Met Gin Cys 

195 200 205 

Leu Met Arg His Leu Ala Gin Ala Gly Ala Thr Val Val Val Val Phe 

210 215 220 

Ser Ser Arg Gin Leu Ala Arg Val Phe Phe Glu Ser Val Val Leu Thr 
225 230 235 240 

Asn Leu Thr Gly Lys Val Trp Val Ala Ser Glu Ala Trp Ala Leu Ser 

245 250 255 

Arg His He Thr Gly Val Pro Gly He Gin Arg He Gly Met Val Leu 

260 265 270 

Gly Val Ala He Gin Lys Arg Ala Val Pro Gly Leu Lys Ala Phe Glu 

275 280 285 

Glu Ala Tyr Ala Arg Ala Asp Lys Lys Ala Pro Arg Pro Cys His Lys 



835 



840 
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290 295 300 

Gly Ser Trp Cys Ser Ser Asn Gin Leu Cys Arg Glu Cys Gin Ala Phe 
305 310 315 320 

Met Ala His Thr Met Pro Lys Leu Lys Ala Phe Ser Met Ser Ser Ala 

325 ~ 330 335 

Tyr Asn Ala Tyr Arg Ala Val Tyr Ala Val Ala His Gly Leu His Gin 

340 345 350 

Leu Leu Gly Cys Ala Ser Gly Ala Cys Ser Arg Gly Arg Val Tyr Pro 

355 360 365 

Trp Gin Leu Leu Glu Gin lie His Lys Val His Phe Leu Leu His Lys 

370 375 380 

Asp Thr Val Ala Phe Asn Asp Asn Arg Asp Pro Leu Ser Ser Tyr Asn 
385 39.0 395 400 

lie lie Ala Trp Asp Trp Asn Gly Pro Lys Trp Thr Phe Thr Val Leu 

405 410 415 

Gly Ser Ser Thr Trp Ser Pro Val Gin Leu Asn lie Asn Glu Thr Lys. 

420 425 430 

lie Gin Trp His Gly Lys Asp Asn Gin Val Pro Lys Ser Val Cys Ser 

435 440 445 

Ser Asp Cys Leu Glu Gly His Gin Arg Val Val Thr Gly Phe His His 

450 455 460 

Cys Cys Phe Glu Cys Val Pro Cys Gly Ala Gly Thr Phe Leu Asn Lys 
465 470 475 480 

Ser Asp Leu Tyr Arg Cys Gin Pro Cys Gly Lys Glu Glu Trp Ala Pro 

485 490 495 

Glu Gly Ser Gin Thr Cys Phe Pro Arg Thr Val Val Phe Leu Glu Trp 

500 505 510 

Ser Asp lie Glu Ser lie lie Ala He Ala Phe Ser Cys Leu Gly He 

515 520 525 

Leu Val Thr Leu Phe Val Thr Leu He Phe Val Leu Tyr Arg Asp Thr 

530 535 540 

Pro Val Val Lys Ser Ser Ser Arg Glu Leu Cys Tyr He He Leu Ala 
545 550 555 560 

Gly He Phe Leu Gly Tyr Val Cys Pro Phe Thr Leu He Ala Lys Pro 

565 570 575 

Thr Thr Thr Ser Cys Tyr Leu Gin Arg Leu Leu Val Gly Leu Ser Ser 

580 585 590 

Ala Met Cys Tyr Ser Ala Leu Val Thr Lys Thr Asn Arg He Ala Arg 

595 600 605 

He Leu Ala Gly Ser Lys Lys Lys He Cys Thr Arg Lys Pro Arg Phe 

610 615 620 

Met Ser Ala Trp Ala Gin Val He He Ala Ser He Leu He Ser Val 
625 630 635 640 

Gin Leu Thr Leu Val Val Thr Leu He He Met Glu Pro Pro Met Pro 

645 650 655 

He Leu Ser Tyr Pro Ser He Lys Glu Val Tyr Leu He Cys Asn Thr 

660 665 670 

Ser Asn Leu Gly Val Val Ala Pro Val Gly Tyr Asn Gly Leu Leu He 

675 680 685 

Met Ser Cys Thr Tyr Tyr Ala Phe Lys Thr Arg Asn Val Pro Ala Asn 

690 695 700 

Phe Asn Glu Ala Lys Tyr lie Ala Phe Thr Met Tyr Thr Thr Cys lie 
705 710 715 " 720 

He Trp Leu Ala Phe Val Pro He Tyr Phe Gly Ser Asn Tyr Lys He 

725 730 735 

He Thr Thr Cys Phe Ala Val Ser Leu Ser Val Thr Val Ala Leu Gly 
740 745 750 
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Cys Met Phe Thr Pro Lys Met Tyr lie lie lie Ala Lys Pro Glu Arg 

755 760 765 

Asn Val Arg Ser Ala Phe Thr Thr Ser Asp Val Val Arg Met His. Val 

770 775 780 

Gly Asp Gly Lys Leu Pro Cys Arg Ser Asn Thr Phe Leu Asn lie Phe 
785 790 795 800 

Arg Arg Lys Lys Pro Gly Ala Gly Asn Ala Asn Ser Asn Gly Lys Ser 

805 810 815 

Val Ser Trp Ser Glu Pro Gly Gly Arg Gin Ala Pro Lys Gly Gin His 

820 825 830 

Val Trp Gin Arg Leu Ser Val His Val Lys Thr Asn Glu Thr Ala Cys 

835 840 845 

Asn Gin Thr Ala Val lie Lys Pro Leu Thr Lys Ser Tyr Gin Gly Ser 

850 855 860 

Gly Lys Ser Leu Thr Phe Ser Asp Ala Ser Thr Lys Thr Leu Tyr Asn 
865 870 875 880 

Val Glu Glu Glu Asp Asn Thr Pro Ser Ala His Phe Ser Pro Pro Ser 

885 890 895 

Ser Pro Ser Met Val Val His Arg Arg Gly Pro Pro Val Ala Thr Thr 

900 905 910 

Pro Pro Leu Pro Pro His Leu Thr Ala Glu Glu Thr Pro Leu Phe Leu 

915 920 925 

Ala Asp Ser Val lie Pro Lys Gly Leu Pro Pro Pro Leu Pro Gin Gin 

930 935 940 

Gin Pro Gin Gin Pro Pro Pro Gin Gin Pro Pro Gin Gin Pro Lys Ser 
945 950 955 960 

Leu Met Asp Gin Leu Gin Gly Val Val Thr Asn Phe Gly Ser Gly He 

965 970 975 

Pro Asp Phe His Ala Val Leu Ala Gly Pro Gly Thr Pro Gly Asn Ser 

980 985 990 

Leu Arg Ser Leu Tyr Pro Pro Pro Pro Pro Pro Gin His Leu Gin Met 

995 1000 1005 

Leu Pro Leu His Leu Ser Thr Phe Gin Glu Glu Ser He Ser Pro Pro 

1010 1015 1020 

Gly Glu Asp He Asp Asp Asp Ser Glu Arg Phe Lys Leu Leu Gin Glu 
1025 1030 1035 1040 

Phe Val Tyr Glu Arg Glu Gly Asn Thr Glu Glu Asp Glu Leu Glu Glu 

1045 1050 1055 

Glu Glu Asp Leu Pro Thr Ala Ser Lys Leu Thr Pro Glu Asp Ser Pro 

1060 1065 1070 

Ala Leu Thr Pro Pro Ser Pro Phe Arg Asp Ser Val Ala Ser Gly Ser 

1075 1080 1085 

Ser Val Pro Ser Ser Pro Val Ser Glu Ser Val Leu Cys Thr Pro Pro 

1090 1095 1100 

Asn Val Thr Tyr Ala Ser Val He Leu Arg Asp Tyr Lys Gin Ser Ser 
1105 1110 1115 " 1120 

Ser Thr Leu 



<210> 29 

<211> 1172 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note o 
synthetic construct 
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<400> ,29 

Met Gly Pro Arg Ala Lys Thr lie Cya Ser Leu Phe Phe Leu Leu Trp 

1 5 .10 15 

Val Leu Ala Glu Pro Ala Glu Asn Ser Asp Phe Tyr Leu Pro Gly Asp 

20 25 30 

Tyr Leu Leu Gly Gly Leu Phe Ser Leu His Ala Asn Met Lys Gly lie 

35 40 45 

Val His Leu Asn Phe Leu Gin Val Pro Met Cys Lys Glu Tyr Glu Val 

50 55 60 

Lys Val lie Gly Tyr Asn Leu Met Gin Ala Met Arg Phe Ala Val Glu 
65 70 75 80 

Glu lie Asn Asn Asp Ser Ser Leu Leu Pro Gly Val Leu Leu Gly Tyr 

85 90 ■ 95 

Glu lie Val Asp Val Cys Tyr lie Ser Asn Asn Val Gin Pro Val Leu 

100 105 110 

Tyr Phe Leu Ala His Glu Asp Asn Leu Leu Pro lie Gin Glu Asp Tyr 

115 ~ 120 125 

Ser Asn Tyr lie Ser Arg Val Val Ala Val lie Gly Pro Asp Asn Ser 

130 135 140 

Glu Ser Val Met Thr Val Ala Asn Phe Leu Ser Leu Phe Leu Leu Pro 
145 150 155 160 

Gin lie Thr Tyr Ser Ala lie Ser Asp Glu Leu Arg Asp Lys Val Arg 

165 170 175 

Phe Pro Ala Leu Leu Arg Thr Thr Pro Ser Ala Asp His His Val Glu 

180 185 190 

Ala Met . Val Gin Leu Met Leu His Phe Arg Trp Asn Trp lie He Val 

195 200 205 

Leu Val Ser Ser Asp Thr Tyr Gly Arg Asp Asn Gly Gin Leu Leu Gly ' 

210 215 220 

Glu Arg Val Ala Arg Arg Asp He Cys He Ala Phe Gin Glu Thr Leu 
225 230 235 240 

Pro Thr Leu Gin Pro Asn Gin Asn Met Thr Ser Glu Glu Arg Gin Arg 

245 250 255 

Leu Val Thr He Val Asp Lys Leu Gin Gin Ser Thr Ala Arg Val Val 

260 265 270 

Val Val Phe Ser Pro Asp Leu Thr Leu Tyr His Phe Phe Asn Glu Val 

275 280 285 

Leu Arg Gin Asn Phe Thr Gly Ala Val Trp He Ala Ser Glu Ser Trp 

290 295 300 

Ala He Asp Pro Val Leu His Asn Leu Thr Glu Leu Gly His Leu Gly 
305 310 315 " 320 

Thr Phe Leu Gly He Thr He Gin Ser Val Pro He Pro Gly Phe Ser 

325 330 335 

Glu Phe Arg Glu Trp Gly Pro Gin Ala Gly Pro Pro Pro Leu Ser Arg 

340 345 350 

Thr Ser Gin Ser Tyr Thr Cys Asn Gin Glu Cys Asp Asn Cys Leu Asn 

355 360 * 365 

Ala Thr Leu Ser Phe Asn Thr He Leu Arg Leu Ser Gly Glu Arg Val 

370 375 380 

Val Tyr Ser Val Tyr Ser Ala Val Tyr Ala Val Ala His Ala Leu His 
385 390 395 400 

Ser Leu Leu Gly Cys Asp Lys Ser Thr Cys Thr Lys Arg Val Val Tyr 

405 410 415 

Pro Trp Gin Leu Leu Glu Glu lie Trp Lys Val Asn Phe Thr Leu Leu 

420 425 430 

Asp His Gin He Phe Phe Asp Pro Gin Gly Asp Val Ala Leu His Leu 
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435 440 445 

Glu He Val Gin Trp Gin Trp Asp Arg Ser Gin Asn Pro Phe Gin Ser 

450 455 460 

Val Ala Ser Tyr Tyr Pro Leu Gin Arg Gin Leu Lys Asn He Gin Asp 
465 470 475 480 

He Ser Trp His Thr Val Asn Asn Thr He Pro Met Ser Met Cys Ser 

485 490 495 

Lys Arg Cys Gin Ser Gly Gin Lys Lys Lys Pro Val Gly He His Val 

500 505 510 

Cys Cys Phe Glu Cys He Asp Cys Leu Pro Gly Thr Phe Leu Asn His 

515 520 525 

Thr Glu Asp Glu Tyr Glu Cys Gin Ala Cys Pro Asn Asn Glu Trp Ser 

530 535 540 

Tyr Gin Ser Glu Thr Ser Cys Phe Lys Arg Gin Leu Val Phe Leu Glu 
545 550 555 560 

Trp Ser Asp He Glu Ser lie He Ala He Ala Phe Ser Cys Leu Gly 

565 570 575 

He Leu Val Thr Leu Phe Val Thr Leu He Phe Val Leu Tyr Arg Asp 

580 585 590 

Thr Pro Val Val Lys Ser Ser Ser Arg Glu Leu Cys Tyr He He Leu 

595 600 605 

Ala Gly He Phe Leu Gly Tyr Val Cys Pro Phe Thr Leu He Ala Lys 

610 615 620 

Pro Thr Thr Thr Ser Cys Tyr Leu Gin Arg Leu Leu Val Gly Leu Ser 
625 630 635 640 

Ser Ala Met Cys Tyr Ser Ala Leu Val Thr Lys Thr Asn Arg He Ala 

645 650 655 

Arg He Leu Ala Gly Ser Lys Lys Lys He Cys Thr Arg Lys Pro Arg 

660 665 670 

Phe Met Ser Ala Trp Ala Gin Val lie He Ala Ser He Leu He Ser 

675 680 685 

Val Gin Leu Thr Leu Val Val Thr Leu He He Met Glu Pro Pro Met 

690 695 700 

Pro He Leu Ser Tyr Pro Ser He Lys Glu Val Tyr Leu He Cys Asn 
705 710 715 720 

Thr Ser Asn Leu Gly Val Val Ala Pro Val Gly Tyr Asn Gly Leu Leu 

725 730 735 

He Met Ser Cys Thr Tyr Tyr Ala Phe Lys Thr Arg Asn Val Pro Ala 

740 745 750 

Asn Phe Asn Glu Ala Lys Tyr He Ala Phe Thr Met Tyr Thr Thr Cys 

755 760 765 

He He Trp Leu Ala Phe Val Pro He Tyr Phe Gly Ser Asn Tyr Lys 

770 775 780 

He He Thr Thr Cys Phe Ala Val Ser Leu Ser Val Thr Val Ala Leu 
785 790 795 800 

Gly Cys Met Phe Thr Pro Lys Met Tyr He He He Ala Lys Pro Glu 

805 810 815 

Arg Asn Val Arg Ser Ala Phe Thr Thr Ser Asp Val Val Arg Met His 

620 825 830 

Val Gly Asp Gly Lys Leu Pro Cys Arg Ser Asn Thr Phe Leu Asn He 

835 840 845 

Phe Arg Arg Lys Lys Pro Gly Ala Gly Asn Ala Asn Ser Asn Gly Lys 

850 855 860 

Ser Val Ser Trp Ser Glu Pro Gly Gly Arg Gin Ala Pro Lys Gly Gin 
865 870 875 880 

His Val Trp Gin Arg Leu Ser Val His Val Lys Thr Asn Glu Thr Ala 
885 890 895 
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Cys 


Asn 


Gin 


Thr 
900 


Ala 


Val 


lie 


Lys 


Pro 
905 


Leu 


Thr 


Lys 


Ser 


Tyr 
910 


Gin 


Gly 


Ser 


Gly 


Lys 
915 


Ser 


Leu 


Thr 


Phe 


Ser 
920 


Asp 


Ala 


Ser 


Thr 


Lys 
925 


Thr 


Leu 


Tyr 


Asn 


Val 
930 


Glu 


Glu 


Glu 


Asp 


Asn 
935 


Thr 


Pro 


Ser 


Ala 


His 
940 


Phe 


Ser 


Pro 


Pro 


Ser 


Ser 


Pro 


Ser 


Met 


Val 


Val 


His 


Arg 


Arg 


Gly 


Pro 


Pro 


Val 


Ala 


Thr 


945 










950 










955 










960 


Thr 


Pro 


Pro 


Leu 


Pro 
965 


Pro 


His 


Leu 


Thr 


Ala 
970 


Glu 


Glu 


Thr 


Pro 


Leu 
975 


Phe 


Leu 


Ala 


Asp 


Ser 
980 


Val 


He 


Pro 


Lys 


Gly 
985 


Leu 


Pro 


Pro 


Pro 


Leu 
990 


Pro 


Gin 


Gin 


Gin 


Pro 


Gin 


Gin 


Pro 


Pro 


Pro 


Gin 


Gin 


Pro 


Pro 


Gin 


Gin 


Pro 


Lys 






995 










1000 








1005 






Ser 


Leu 


Met 


Asp 


Gin 


Leu 


Gin 


Gly Val 


Val 


Thr 


Asn 


Phe Gly 


Ser 


Gly 




1010 








1015 








1020 








He 


Pro Asp 


Phe 


His 


Ala 


Val 


Leu 


Ala 


Gly 


Pro 


Gly Thr Pro 


Gly 


Asn 


1025 








1030 








1035 








1040 


Ser Leu Arg 


Ser 


Leu 


Tyr 


Pro 


Pro 


Pro 


Pro 


Pro 


Pro 


Gin 


His 


Leu 


Gin 










1045 








1050 








1055 


Met 


Leu 


Pro 


Leu 


His 


Leu 


Ser 


Thr 


Phe 


Gin 


Glu 


Glu 


Ser 


He 


Ser 


Pro 



1060 1065 1070 

Pro Gly Glu Asp He Asp Asp Asp Ser Glu Arg Phe Lys Leu Leu Gin 

1075 1080 1085 

Glu Phe Val Tyr Glu Arg Glu Gly Asn Thr Glu Glu Asp Glu Leu Glu 

1090 1095 1100 

Glu Glu Glu Asp Leu Pro Thr Ala Ser Lys Leu Thr Pro Glu Asp Ser 
1105 1110 1115 1120 

Pro Ala Leu Thr Pro Pro Ser Pro Phe Arg Asp Ser Val Ala Ser Gly 

1125 1130 1135 

Ser Ser Val Pro Ser Ser Pro Val Ser Glu Ser Val Leu Cys Thr Pro 

1140 1145 1150 

Pro Asn Val Thr Tyr Ala Ser Val He Leu Arg Asp Tyr Lys Gin Ser 

1155 1160 1165 

Ser Ser Thr Leu 
1170 

<210> 30 
<211> 1175 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 30 



Met Leu Gly Pro 


Ala 


Val Leu 


Gly 


Leu 


Ser 


Leu 


Trp Ala 


Leu Leu His 


1 


5 








10 






15 


Pro Gly Thr Gly 


Ala 


Pro Leu 


Cys 


Leu 


Ser 


Gin 


Gin Leu 


Arg Met Lys 


20 








25 








30 . 


Gly Asp Tyr Val 


Leu 


Gly Gly 


Leu 


Phe 


Pro 


Leu 


Gly Glu 


Ala Glu Glu 


35 






40 








45 




Ala Gly Leu Arg 


Ser 


Arg Thr 


Arg 


Pro 


Ser 


Ser 


Pro Val 


Cys Thr Arg 


50 




55 










60 




Phe Ser Ser Asn 


Gly 


Leu Leu 


Trp 


Ala 


Leu 


Ala 


Met Lys 


Met Ala Val 


65 




70 








75 




80 
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Glu Glu 


He 


Asn 


Asn 


Lys Ser 


Asp Leu 


Leu 


Pro Gly Leu Arg Leu Gly 








85 






90 


95 


Tyr Asp 


Leu 


Phe 


Asp 


Thr Cys 


Ser Glu 


Pro 


Val Val Ala Met Lys Pro 






100 






105 




110 


Ser Leu 


Met 


Phe 


Leu 


Ala Lys 


Ala Gly 


Ser 


Arg Asp He Ala Ala Tyr 




115 








120 




125 


Cys Asn 


Tyr 


Thr 


Gin 


Tyr Gin 


Pro Arg 


Val 


Leu Ala Val He Gly Pro 


130 








135 






140 


His Ser 


Ser 


Glu 


Leu 


Ala Met 


Val Thr 


Gly 


Lys Phe Phe Ser Phe Phe 


145 








150 






155 160 


Leu Met 


Pro 


Gin 


Val 


Ser Tyr 


Gly Ala 


Ser 


Met Glu Leu Leu Ser Ala 








165 






170 


175 


Arg Glu 


Thr 


Phe 


Pro 


Ser Phe 


Phe Arg 


Thr 


Val Pro Ser Asp Arg Val 






180 






185 




190 


Gin Leu 


Thr 


Ala 


Ala 


Ala Glu 


Leu Leu 


Gin 


Glu Phe Gly Trp Asn Trp 




195 








200 




205 


Val Ala 


Ala 


Leu 


Gly 


Ser Asp 


Asp Glu 


Tyr 


Gly Arg Gin Gly Leu Ser 


210 








215 






220 


He Phe 


Ser 


Ala 


Leu 


Ala Ala 


Ala Arg 


Gly 


He Cys He Ala His Glu 


225 








230 






235 240 


Gly Leu 


Val 


Pro 


Leu 


Pro Arg 


Ala Asp 


Asp 


Ser Arg Leu Gly Lys Val 








245 






250 


255 


Gin Asp 


Val 


Leu 


His 


Gin Val 


Asn Gin 


Ser 


Ser Val Gin Val Val Leu 






260 






265 




270 


Leu Phe 


Ala 


Ser 


Val 


His Ala 


Ala His 


Ala 


Leu Phe Asn Tyr Ser He 




275 








280 




285 


Ser Ser 


Arg 


Leu 


Ser 


Pro Lys 


Val Trp 


Val 


Ala Ser Glu Ala Trp Leu 


290 








295 






300 


Thr Ser 


Asp 


Leu 


Val 


Met Gly 


Leu Pro 


Gly 


Met Ala Gin Met Gly Thr 


305 








310 






315 320 


Val Leu 


Gly 


Phe 


Leu 


Gin Arg 


Gly Ala 


Gin 


Leu His Glu Phe Pro Gin 








325 






330 


335 


Tyr Val 


Lys 


Thr 


His 


Leu Ala 


Leu Ala 


Thr 


Asp Pro Ala Phe Cys Ser 






340 






345 




350 


Ala Leu 


Gly 


Glu 


Arg 


Glu Gin 


Gly Leu 


Glu 


Glu Asp Val Val Gly Gin 




355 








360 




365 


Arg Cys 


Pro 


Gin 


Cys 


Asp Cys 


lie. Thr 


Leu 


Gin Asn Val Ser Ala Gly 


370 








375 






380 


Leu Asn 


His 


His 


Gin 


Thr Phe 


Ser Val 


Tyr 


Ala Ala Val Tyr Ser Val 


385 








390 






395 400 


Ala Gin 


Ala 


Leu 


His 


Asn Thr 


Leu Gin 


Cys 


Asn Ala Ser Gly Cys Pro 








405 






410 


415 


Ala Gin 


Asp 


Pro 


Val 


Lys Pro 


Trp Gin 


Leu 


Leu Glu Asn Met Tyr Asn 






420 






425 




430 


Leu Thr 


Phe 


His 


Val 


Gly Gly 


Leu Pro 


Leu 


Arg Phe Asp Ser Ser Gly 




435 








440 




445 


Asn Val 


Asp 


Met 


Glu 


Tyr Asp 


Leu Lys 


Leu 


Trp Val Trp Gin Gly Ser 


450 








455 






460 


Val Pro 


Arq 


Leu 


His 


Asp Val 


Glv Arg 


Phe 


Asn Gly Ser Leu Arg Thr 


465 








470 






475 480 


Glu Arg 


Leu 


Lys 


He 


Arg Trp 


His Thr 


Ser 


Asp Asn Gin Lys Pro Val 








485 






490 


495 


Ser Arg 


Cys 


Ser 


Arg 


Gin Cys 


Gin Glu 


Gly 


Gin Val Arg Arg Val Lys 






500 






505 




510 


Gly Phe 


His 


Ser 


Cys 


Cys Tyr 


Asp Cys 


Val 


Asp Cys Glu Ala Gly Ser 




515 








520 




525 



48 



WO 2005/015158 



PCT/US2004/025459 



Tvr 


Ara 


Gin 


Asn 


Pro 


Asp 


Asp 

XT 


He 




530 










535 




Glu 




Ser 


Pro 


Glu 


Arcr 


Ser 


Thr 


545 










550 






Phe 


Leu 


Glu 


Trn 


Ser 


Asp 


He 


Glu 










565 








Cvs 

\-y o 


Leu 


Glv 


He 


Leu 


Val 


Thr 


Leu 








580 










Tvr 


Aro 


Asp 


Thr 


Pro 


Val 


Val 


Lvs 






595 










600 


lie 


lie 


Leu 


Ala 


Glv 


He 


Phe 


Leu 




610 










615 




lie 


Ala 


Lvs 


Pro 


Thr 


Thr 


Thr 


Ser 


625 










630 






Glv 


Leu 


Ser 


Ser 


Ala 


Met 


Cvs 


Tvr 










645 








Ara 


lie 


Ala 


Ara 


He 


Leu 


Ala 


Glv 








660 










Lys 


Pro 


Arcr 


Phe 


Met 


Ser 


Ala 


Trr> 






675 










680 


Leu 


lie 


Ser 


Val 


Gin 


Leu 


Thr 


Leu 




690 










695 




Pro 


Pro 


Met 


Pro 


He 


Leu 


Ser 


Tyr 


705 










710 






Tie 




Asn 


Thr 


Ser 


Asn 


Leu 


Ui J 










no 5 








Glv 


Leu 


Leu 


He 


Met 


Ser 


Cvs 


Thr 


















Val 


Pro 


Ala 


Asn 


Phe 


Asn 


Glu 


Ala 






755 










760 


Thr 


Thr 


Cvs 
wy o 


He 


He 


Tm 


Leu 


Ala 




770 










775 




Asn 


Tvr 


Lys 


He 


He 


Thr 


Thr 


Cys 


785 










790 






Val 


Ala 


Leu 


Glv 


Cys 


Met 


Phe 


Thr 










805 








Lys 


Pro 


Glu 


Ara 


Asn 


Val 


Ara 


Ser 








820 












Met 


His 


Val 


Glv 

oxy 


Asp 


Glv 


Lys 






835 










840 


Leu 


Asn 


lie 


Phe 


Ara 


Arcr 


Lys 


Lys 




850 










855 




Han 


Glv 


Lys 


Ser 


Val 


Ser 


Tro 


Ser 


865 










870 






Lys 


Gly 


Gin 


His 


Val 




Gin 


Ara 










885 








ulU • 


Thr 


Al a 


^*y o 






Thr 


Ala 








Qfifi 










Tyr 


Gin 


Gly 


Ser 


Gly 


Lys 


Ser 


Leu 






915 










920 


Thr 


Leu 


Tyr 


Asn 


Val 


Glu 


Glu 


Glu 




930 










935 




Ser 


Pro 


Pro 


Ser 


Ser 


Pro 


Ser 


Met 


945 










950 






Val 


Ala 


Thr 


Thr 


Pro 


Pro 


Leu 


Pro 



965 



Ala 


Cvs 


Thr Phe Cys 


Gly Gin 


Asp 






. 540 






Ara 


Cys 


Phe Ara Ara 


Arg Ser 


Ara 






555 




560 


Ser 


He 


He Ala He 


Ala Phe 


Ser 




570 




575 




Phe 


Val 


Thr Leu He 


Phe Val 


Leu 


585 






590 




Ser 


Ser 


Ser Arg Glu 


Leu Cys 


Tvr 






605 






Glv 


Tvr 


Val Cys Pro 


Phe Thr 


Leu 






620 






Cvs 


Tvr 


Leu Gin Arg 


Leu Leu 


Val 






635 




640 


Ser 


Ala 


Leu Val Thr 


Lys Thr 


Asn 




650 




655 




Ser 


Lys 


Lvs Lvs Tie 


Cys Thr 


Ara 


665 






670 




Ala 


Gin 


Val He He 


Ala Ser 


He 






685 






Val 


Val 


Thr Leu He 


He Met 


Glu 






700 






Pro 


Ser 


He Lys Glu 


Val Tyr 


Leu 






715 




720 


Val 


Val 


Ala Pro Val 

nio nu vcijl 


Qlv Tvr 


Asn 




7"* 0 




7** R 




A / L 




Ala Phe Lvs 


Thr Arg 


Asn 












Lys 


xyi 


Tie Ala Phe 

x .1. c rti.a rue 


Thr Met 


Tvr 






ZDS 






Phe 


Val 


Pro lie Tvr 


Phe Gly 


Ser 






780 






Phe 


Ala 


Val Ser lieu 

VOX OC1 JJCU 


Ser Val 


Thr 






795 




800 


Pro 


Lys 


Met Tyr He 


He He 


Ala 




810 




815 




Ala 


Phe 


Thr Thr Ser 

A-iiJ. AAAJa UVi. 


Asp Val 


Val 


825 






830 




Leu 


Pro 


Cvs Ara Ser 


Asn Thr 

noil 


Phe 






845 






Pro 


Glv 


Ala Glv Asn 


Ala Asn 


Ser 






860 






Glu 


Pro 


Glv Glv Ara 


Gin Ala 


Pro 






875 




880 


Leu 


Ser 


Val Wis Val 

VAX mo Vul 


Lys Thr 


Asn 




890 




895 




Val 


Tie 


TiVS Pro Tien 


Thr TiVR 
J.JJ.J. jjy o 


Opr 


Qfi5 
j w _» 






91 0 




1X1X7 


riie 


OcJ. HSp nla 




Lys 






925 






Asp 


Asn 


Thr Pro Ser 


Ala His 


Phe 






940 






Val 


Val 


His Arg Arg 


Gly Pro 


Pro 






955 




960 


Pro 


His 


Leu Thr Ala 


Glu Glu 


Thr 




970 




975 
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Pro Leu Phe Leu Ala Asp Ser Val lie Pro Lys Gly Leu Pro Pro Pro 

980 985 990 

Leu Pro Gin Gin Gin Pro Gin Gin Pro Pro Pro Gin Gin Pro Pro Gin 

995 1000 1005 

Gin Pro Lys Ser Leu Met Asp Gin Leu Gin Gly Val Val Thr Asn Phe 

1010 1015 1020 

Gly Ser Gly lie Pro Asp Phe His Ala Val Leu Ala Gly Pro Gly Thr 
1025 1030 1035 1040 

Pro Gly Asn Ser Leu Arg Ser Leu Tyr Pro Pro Pro Pro Pro Pro Gin 

1045 1050 1055 

His Leu Gin Met Leu Pro Leu His Leu Ser Thr Phe Gin Glu Glu Ser 

1060 1065 1070 

lie Ser Pro Pro Gly Glu Asp lie Asp Asp Asp Ser Glu Arg Phe Lys 

1075 1080 1085 

Leu Leu Gin Glu Phe Val Tyr Glu Arg Glu Gly Asn Thr Glu Glu Asp 

1090 1095 1100 

Glu Leu Glu Glu Glu Glu Asp Leu Pro Thr Ala Ser Lys Leu Thr Pro 
1105 1110 1115 1120 

Glu Asp Ser Pro Ala Leu Thr Pro Pro Ser Pro Phe Arg Asp Ser Val 

1125 1130 1135 

Ala Ser Gly Ser Ser Val Pro Ser Ser Pro Val Ser Glu Ser Val Leu 

1140 1145 1150 

Cys Thr Pro Pro Asn Val Thr Tyr Ala Ser Val lie Leu Arg Asp Tyr 

1155 1160 1165 

Lys Gin Ser Ser Ser Thr Leu 
1170 1175 

<210> 31 . 
<211> 867 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note » 
synthetic construct 



<400> 31 



Met 


Val Arg Leu 


Leu Leu 


He 


Phe 


Phe 


Pro 


Met 


He 


Phe Leu 


Glu 


Met 


1 




5 








10 








15 




Ser 


lie Leu Pro 


Arg Met 


Pro 


Asp 


Arg 


Lys 


Val 


Leu 


Leu Ala 


Gly 


Ala 




20 








25 








30 






Ser 


Ser Gin Arg 


Ser Val 


Ala 


Arg 


Met 


Asp 


Gly 


Asp 


Val He 


lie 


Gly 




35 






40 










45 






Ala 


Leu Phe Ser 


Val His 


His 


Gin 


Pro 


Pro 


Ala 


Glu 


Lys Val 


Pro 


Glu 




50 




55 










60 








Arg 


Lys Cys Gly 


Glu He 


Arg 


Glu 


Gin 


Tyr 


Gly 


He 


Gin Arg 


Val 


Glu 


65 




70 










75 








80 


Ala- 


Met Phe His 


Thr Leu 


Asp 


Lys 


He 


Asn 


Ala 


Asp 


Pro Val 


Leu 


Leu 






85 








90 








95 




Pro 


Asn lie Thr 


Leu Gly 


Ser 


Glu 


He 


Arg 


Asp 


Ser 


Cys Trp 


His 


Ser 




100 








105 








110 






Ser 


Val Ala Leu 


Glu Gin 


Ser 


He 


Glu 


Phe 


He 


Arg 


Asp Ser 


Leu 


He 




115 






120 










125 






Ser 


lie Arg Asp 


Glu Lys 


Asp 


Gly 


Leu 


Asn 


Arg 


Cys 


Leu Pro 


Asp 


Gly 




130 




135 










140 








Gin 


Thr Leu Pro 


Pro Gly 


Arg 


Thr 


Lys 


Lys 


Pro 


He 


Ala Gly 


Val 


He 


145 




150 










155 








160 
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Gly Pro Gly Ser Ser Ser Val Ala He Gin Val Gin Asn Leu Leu Gin 

165 170 175 

Leu Phe Asp He Pro Gin He Ala Tyr Ser Ala Thr Ser . He Asp Leu 

180 185 190 

Ser Asp Lys Thr Leu Tyr Lys Tyr Phe Leu Arg Val Val Pro Ser Asp 

195 200 205 

Thr Leu Gin Ala Arg Ala Met Leu Asp He Val Lys Arg Tyr Asn Trp 

210 215 220 

Thr Tyr Val Ser Ala Val His Thr Glu Gly Asn Tyr Gly Glu Ser Gly 
225 230 235 240 

Met Asp Ala Phe Lys Glu Leu Ala Ala Gin Glu Gly Leu Cys He Ala 

245 250 255 

His Ser Asp Lys He Tyr Ser Asn Ala Gly Glu Lys Ser Phe Asp Arg 

260 265 270 

Leu Leu Arg Lys Leu Arg Glu Arg Leu Pro Lys Ala Arg Val Val Val 

275 280 285 

Cys Phe Cys Glu Gly Met Thr Val Arg Gly Leu Leu Ser Ala Met Arg 

290 295 300 

Arg Leu Gly Val Val Gly Glu Phe Ser Leu He Gly Ser Asp Gly Trp 
305 310 315 " 320 

Ala Asp Arg Asp Glu Val He Glu Gly Tyr Glu Val Glu Ala Asn Gly 

325 330 335 

Gly He Thr He Lys Leu Gin Ser Pro Glu Val Arg Ser Phe Asp Asp 

340 345 350 

Tyr Phe Leu Lys Leu Arg Leu Asp Thr Asn Thr Arg Asn Pro Trp Phe 

355 360 365 

Pro Glu Phe Trp Gin His Arg Phe Gin Cys Arg Leu Pro Gly His Leu 

370 375 380 

Leu Glu Asn Pro Asn Phe Lys Lys Val Cys Thr Gly Asn Glu Ser Leu 
385 390 395 400 

Glu Glu Asn Tyr Val Gin Asp Ser Lys Met Gly Phe Val He Asn Ala 

405 410 415 

He Tyr Ala Met Ala His Gly Leu Gin Asn Met His His Ala Leu Cys 

420 425 430 

Pro Gly His Val Gly Leu Cys Asp Ala Met Lys Pro He Asp Gly Arg 

435 440 445 

Lys Leu Leu Asp Phe Leu He Lys Ser Ser Phe Val Gly Val Ser Gly 

450 455 460 

Glu Glu Val Trp Phe Asp Glu Lys Gly Asp Ala Pro Gly Arg Tyr Asp 
465 470 475 " 480 

He Met Asn Leu Gin Tyr Thr Glu Ala Asn Arg Tyr Asp Tyr Val His 

485 490 495 

Val Gly Thr Trp His Glu Gly Val Leu Asn He Asp Asp Tyr Lys He 

500 505 510 

Gin. Met Asn Lys Ser Gly Met Val Arg Ser Val Cys Ser Glu Pro Cys 

515 520 525 

Leu Lys Gly Gin He Lys Val He Arg Lys Gly Glu Val Ser Cys Cys 

530 535 540 

Trp He Cys Thr Ala Cys Lys Glu Asn Glu Phe Val Gin Asp Glu Phe 
545 550 555 560 

Thr Cys Arg Ala Cys Asp Leu Gly Trp Trp Pro Asn Ala Glu Leu Thr 

565 570 575 

Gly Cys Glu Pro He Pro Val Arg Tyr Leu Glu Leu Arg Glu His Thr 

580 585 590 

Ser Trp Val Leu Leu Ala Ala Asn Thr Leu Leu Leu Leu Leu Leu Leu 

595 600 605 

Gly Thr Ala Gly Leu Phe Ala Trp His Leu Asp Thr Pro Val Val Arg 



51 



WO 2005/015158 



PCT/US2004/025459 



610 615 620 

Ser Ala Gly Gly Arg Leu Cys Phe Leu Met Leu Gly Ser Leu Ala Ala 
625 630 635 640 

Gly Ser Gly Ser Leu Tyr Gly Phe Phe Gly Glu Pro Thr Arg Pro Ala 

645 650 655 

Cys Leu Leu Arg Gin Ala Leu Phe Ala Leu Gly Phe Thr He Phe Leu 

660 665 670 

Ser Cys Leu Thr Val Arg Ser Phe Gin Leu He He He Phe Lys Phe 

675 680 685 

Ser Thr Lys Val Pro Thr Phe Tyr His Ala Trp Val Gin Asn His Gly 

690 695 700 

Ala Gly Leu Phe Val Met He Ser Ser Ala Ala Gin Leu Leu He Cys 
705 710 715 720 

Leu Thr Trp Leu Val Val Trp Thr Pro Leu Pro Ala Arg Glu Tyr Gin 

725 730 ~* 735 

Arg Phe Pro His Leu Val Met Leu Glu Cys Thr Glu Thr Asn Ser Leu 

740 745 750 

Gly Phe He Leu Ala Phe Leu Tyr Asn Gly Leu Leu Ser He Ser Ala 

755 760 765 

Phe Ala Cys Ser Tyr Leu Gly Lys Asp Leu Pro Glu Asn Tyr Asn Glu 

770 775 780 

Ala Lys Cys Val Thr Phe Ser Leu Leu Phe Asn Phe Val Ser Trp He 
785 790 795 800 

Ala Phe Phe Thr Thr Ala Ser Val Tyr Asp Gly Lys Tyr Leu Pro Ala 

805 810 815 

Ala Asn Met Met Ala Gly Leu Ser Ser Leu Ser Ser Gly Phe Gly Gly 

820 825 830 

Tyr Phe Leu Pro Lys Cys Tyr Val He Leu Cys Arg Pro Asp Leu Asn 

835 840 845 

Ser Thr Glu HiB Phe Gin Ala Ser He Gin Asp Tyr Thr Arg Arg Cys 

850 855 860 

Gly Ser Thr 
865 

<210> 32 
<211> 866 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 



<400> 32 
















Met Val 


Arg 


Leu 


Leu 


Leu 


He 


Phe Phe 


Pro Met 


He Phe Leu Glu Met 


1 






5 








10 


15 


Ser He 


Leu 


Pro 
20 


Arg 


Met 


Pro 


Asp Arg 
25 


Lys Val 


Leu Leu Ala Gly Ala 
30 


Ser Ser 


Gin 
35 


Arg 


Ser 


Val 


Ala 


Arg Met 
40 


Asp Gly 


Asp Val He lie Gly 
45 


Ala Leu 


Phe 


Ser 


Val 


His 


His 


Gin Pro 


Pro Ala 


Glu Lys Val Pro Glu 


50 










55 






60 

He Gin Arg Val Glu 


Arg Lys 


Cys 


Gly 


Glu 


He 


Arg 


Glu Gin 


Tyr Gly 


65 








70 






75 


80 


Ala Met 


Phe 


His 


Thr 
85 


Leu 


Asp 


Lys He 


Asn Ala 
90 


Asp Pro Val Leu Leu 
95 


Pro Asn 


He 


Thr 


Leu 


Gly 


Ser 


Glu He 


Arg Asp 


Ser Cys Trp His Ser 
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Ser Val 

Ser He 
130 
Gin Thr 
145 

Gly Pro 

Leu Phe 

Ser Asp 

Thr Leu 
210 
Thr Tyr 
225 

Met Asp 

His Ser 

Leu Leu 

Cys Phe 
290 
Arg Leu 
305 

Ala Asp 

Gly He 

Tyr Phe 

Pro Glu 
370 
Leu Glu 
385 

Glu Glu 

He Tyr 

Pro Gly 

Lys Leu 
450 
Glu Glu 
465 

He Met 

Val Gly 

Gin Met 

Leu Lys 
530 
Trp He 
545 



100 
Ala Leu Glu 
115 

Arg Asp Glu 

Leu Pro Pro 

Gly Ser Ser 
165 

Asp He Pro 
180 

Lys Thr Leu 
195 

Gin Ala Arg 

Val Ser Ala 

Ma Phe Lys 
245 

Asp Lys He 

260 
Arg Lys Leu 
275 

Cys Glu Gly 

Gly Val Val 

Arg Asp Glu 
325 

Thr He Lys 

340 
Leu Lys Leu 
355 

Phe Trp Gin 

Asn Pro Asn 

Asn Tyr Val 
405 

Ala Met Ala 

420 
His Val Gly 
435 

Leu Asp Phe 

Val Trp Phe 

Asn Leu Gin 
485 

Thr Trp His 

500 
Asn Lys Ser 
515 

Gly Gin He 
Cys Thr Ala 



105 110 
Gin Ser He Glu Phe He Arg Asp Ser Leu He 

120 125. 
Lys Asp Gly Leu Asn Arg Cys Leu Pro Asp Gly 

135 140 
Gly Arg Thr Lys Lys Pro He Ala Gly Val He 
150 155 160 

Ser Val Ala He Gin Val Gin Asn Leu Leu Gin 

170 175 
Gin He Ala Tyr Ser Ala Thr Ser He Asp Leu 

185 190 
Tyr Lys Tyr Phe Leu Arg Val Val Pro Ser Asp 

200 205 
Ala Met Leu Asp He Val Lys Arg Tyr Asn Trp 

215 220 
Val His Thr Glu Gly Asn Tyr Gly Glu Ser Gly 
230 235 240 

Glu Leu Ala Ala Gin Glu Gly Leu Cys He Ala 

250 255 
Tyr Ser Asn Ala Gly Glu Lys Ser Phe Asp Arg 

265 270 
Arg Glu Arg Leu Pro Lys Ala Arg Val Val Val 

280 285 
Met Thr Val Arg Gly Leu Leu Ser Ala Met Arg 

295 300 
Gly Glu Phe Ser Leu He Gly Ser Asp Gly Trp 
310 315 320 

Val He Glu Gly Tyr Glu Val Glu Ala Asn Gly 

330 335 
Leu Gin Ser Pro Glu Val Arg Ser Phe Asp Asp 

345 350 
Arg Leu Asp Thr Asn Thr Arg Asn Pro Trp Phe 

360 365 
His Arg Phe Gin Cys Arg Leu Pro Gly His Leu 

375 380 
Phe Lys Lys Val Cys Thr Gly Asn Glu Ser Leu 
390 395 400 

Gin Asp Ser Lys Met Gly Phe Val He Asn Ala 

410 415 
His Gly Leu Gin Asn Met His His Ala Leu Cys 

425 430 
Leu Cys Asp Ala Met Lys Pro He Asp Gly Arg 

440 445 
Leu He Lys Ser Ser Phe Val Gly Val Ser Gly 

455 460 
Asp Glu Lys Gly Asp Ala Pro Gly Arg Tyr Asp 
470 475 480 

Tyr Thr Glu Ala Asn Arg Tyr Asp Tyr Val His 

490 495 
Glu Gly Val Leu Asn lie Asp Asp Tyr Lys He 

505 510 
Gly Met Val Arg Ser Val Cys Ser Glu Pro Cys 

520 525 
Lys Val He Arg Lys Gly Glu Val Ser Cys Cys 

535 540 
Cys Lys Glu Asn Glu Phe Val Gin Asp Glu Phe 
550 555 560 
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Thr Cys Arg Ala Cys Asp Leu Gly Trp Trp Pro Asn Ala Glu Leu Thr 

565 570 575 

Gly Cys Glu Pro He Pro Val Arg Tyr Leu Glu Trp His Glu Ala Pro 

580 585 590 

Thr He Ala Val Ala Leu Leu Ala Ala Leu Gly Phe Leu Ser Thr Leu 

595 600 605 

Ala He Leu Val He Phe Trp Arg His Phe Gin Thr Pro He Val Arg 

610 615 620 

Ser Ala Gly Gly Pro Met Cys Phe Leu Met Leu Thr Leu Leu Leu Val 
625 630 635 640 

Ala Tyr Met Val Val Pro Val Tyr Val Gly Pro Pro Lys Val Ser Thr 

645 650 . 655 

Cys Leu Cys Arg Gin Ala Leu Phe Pro Leu Cys Phe Thr He Cys He 

660 665 670 

Ser Cys He Ala Val Arg Ser Phe Gin He Val Cys Ala Phe Lys Met 

675 680 685 

Ala Ser Arg Phe Pro Arg Ala Tyr Ser Tyr Trp Val Arg Tyr Gin Gly 

690 695 700 

Pro Tyr Val Ser Met Ala Phe He Thr Val Leu Lys Met Val lie Val 
705 710 715 720 

Val He Gly Met Leu Ala Thr Gly Leu Ser Pro Thr Thr Arg Thr Asp 

725 730 735 

Pro Asp Asp Pro Lys lie Thr He Val Ser Cys Asn Pro Asn Tyr Arg 

740 745 750 

Asn Ser Leu Leu Phe Asn Thr Ser Leu Asp Leu Leu Leu Ser Val Val 

755 760 765 

Gly Phe. Ser Phe Ala Tyr Met Gly Lys Glu Leu Pro Thr Asn Tyr Asn 

770 775 780 

Glu Ala Lys Phe lie Thr Leu Ser Met Thr Phe Tyr Phe Thr Ser Ser 
785 790 795 800 

Val Ser Leu Cys Thr Phe Met Ser Ala Tyr Ser Gly Val Leu Val Thr 

805 810 815 

He Val Asp Leu Leu Val Thr Val Leu Asn Leu Leu Ala He Ser Leu 

820 825 830 

Gly Tyr Phe Gly Pro Lys Cys Tyr Met He Leu Phe Tyr Pro Glu Arg 

835 * 840 845 

Asn Thr Pro Ala Tyr Phe Asn Ser Met He Gin Gly Tyr Thr Met Arg 

850 855 860 

Arg Asp 
865 

<210> 33 
<211> 876 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note » 
synthetic construct 

<400> 33 

Met Val Arg Leu Leu Leu He Phe Phe Pro Met He Phe Leu Glu Met 

15 10 15 

Ser He Leu Pro Arg Met Pro Asp Arg Lys Val Leu Leu Ala Gly Ala 

20 25 30 

Ser Ser Gin Arg Ser Val Ala Arg Met Asp Gly Asp Val He He Gly 
35 40 45 
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Ala Leu Phe Ser Val His His Gin Pro Pro Ala Glu Lys Val Pro Glu 

50 55 60 

Arg Lys Cys Gly Glu He Arg Glu Gin Tyr Gly He Gin Arg Val Glu 
65 70 75 80 

Ala Met Phe His Thr Leu Asp Lys He Asn Ala Asp Pro Val Leu Leu 

85 90 95 

Pro Asn He Thr Leu Gly Ser Glu He Arg Asp Ser Cys Trp His Ser 

100 105 110 

Ser Val Ala Leu Glu Gin Ser He Glu Phe He Arg Asp Ser Leu He 

115 120 125 

Ser lie Arg Asp Glu Lys Asp Gly Leu Asn Arg Cys Leu Pro Asp Gly 

130 135 140 

Gin Thr Leu Pro Pro Gly Arg Thr Lys Lys Pro He Ala Gly Val He 
145 150 155 160 

Gly Pro Gly Ser Ser Ser Val Ala He Gin Val Gin Asn Leu Leu Gin 

165 170 175 

Leu Phe Asp He Pro Gin He Ala Tyr Ser Ala Thr Ser He Asp Leu 

180 185 190 

Ser Asp Lys Thr Leu Tyr Lys Tyr Phe Leu Arg Val Val Pro Ser Asp 

195 200 205 

Thr Leu Gin Ala Arg Ala Met Leu Asp lie Val Lys Arg Tyr Asn Trp 

210 215 220 

Thr Tyr Val Ser Ala Val His Thr Glu Gly Asn Tyr Gly Glu Ser Gly 
225 230 235 240 

Met Asp Ala Phe Lys Glu Leu Ala Ala Gin Glu Gly Leu Cys He Ala 

245 250 * 255 

His Ser Asp Lys He Tyr Ser Asn Ala Gly Glu Lys Ser Phe Asp Arg 

260 265 270 

Leu Leu Arg Lys Leu Arg Glu Arg Leu Pro Lys Ala Arg Val Val Val 

275 280 285 

Cys Phe Cys Glu Gly Met Thr Val Arg Gly Leu Leu Ser Ala Met Arg 

290 295 300 

Arg Leu Gly Val Val Gly Glu Phe Ser Leu lie Gly Ser Asp Gly Trp 
305 310 315 320 

Ala Asp Arg Asp Glu Val He Glu Gly Tyr Glu Val Glu Ala Asn Gly 

325 330 335 

Gly lie Thr lie Lys Leu Gin Ser Pro Glu Val Arg Ser Phe Asp Asp 

340 345 350 

Tyr Phe Leu Lys Leu Arg Leu Asp Thr Asn Thr Arg Asn Pro Trp Phe 

355 360 365 

Pro Glu Phe Trp Gin His Arg Phe Gin Cys Arg Leu Pro Gly His Leu 

370 375 380 

Leu Glu Asn Pro Asn Phe Lys Lys Val Cys Thr Gly Asn Glu Ser Leu 
385 390 395 400 

Glu Glu Asn Tyr Val Gin Asp Ser Lys Met Gly Phe Val He Asn Ala 

405 410 415 

lie Tyr Ala Met Ala His Gly Leu Gin Asn Met His His Ala Leu Cys 

420 425 430 

Pro Gly His Val Gly Leu Cys Asp Ala Met Lys Pro lie Asp Gly Arg 

435 440 445 

Lys Leu Leu Asp Phe Leu He Lys Ser Ser Phe Val Gly Val Ser Gly 

450 455 460 

Glu Glu Val Trp Phe Asp Glu Lys Gly Asp Ala Pro Gly Arg Tyr Asp 
465 470 475 480 

lie Met Asn Leu Gin Tyr Thr Glu Ala Asn Arg Tyr Asp Tyr Val His 

485 490 ~ 495 

Val Gly Thr Trp His Glu Gly Val Leu Asn He Asp Asp Tyr Lys He 
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500 



.Gin Met 


Asn Lys Ser Gly 


Met 


Val 




515 




520 


Leu Lys 


Gly Gin He Lys 


Val 


He 


530 




535 




Trp He 


Cys Thr Ala Cys 


Lys 


Glu 


545 


550 






Thr Cys 


Arg Ala Cys Asp 


Leu 


Gly 




565 






Gly Cvs 


Glu Pro He Pro 


Val 


Arg 




580 






Val Leu 


Leu Leu Leu Leu 


Leu 


Leu 




595 




600 


Ala Ala 


Tieu fll v Leu Phe 

ucu vjxjt lieu nic 


Val 


His 


DID 




615 




Ala Ser 


Glv Glv Pro Leu 


Ala 


Cys 




630 






Val Cys 


Leu Ser Val Leu 


Leu 


Phe 




645 






Cvs Leu 


Ala Gin Gin Pro 


Leu 


Ser 




660 






Ser Thr 


Leu Phe Leu Gin 


Ala 


Ala 




675 




680 


Pro Leu 


Ser Trp Ala Asp 


Arcr 


Leu 


690 




695 




Ala Trp 


Leu Val Val Leu 


Leu 


Ala 


705 


710 






Thr Trp 


Tyr Leu Val Ala 


Phe 


Pro 




725 






Met Leu 


Pro Thr Glu Ala 


Leu 


Val 




740 






Ser Phe 


Gly Leu Ala His 


Ala 


Thr 




755 




760 


Phe Leu 


Gly Thr Phe Leu 


Val 


Arcr 


■ 770 




775 




Ala Arcr 

« J- c* ni, y 


Glv Leu Thr Phe 


Ala 


Met 


/ Oj 


790 






Ser Phe 


Val Pro Leu Leu 


Ala 


Asn 










Val Gin 


Met Gly Ala Leu 


Leu 


Leu 




820 






Phe His 


Leu Pro Arg Cys 


Tyr 


Leu 




835 




840 


Thr Pro 


Glu Phe Phe Leu 


Gly 


Gly 


850 




855 




Asn Asp 


Gly Asn Thr Gly 


Asn 


Gin 


865 


870 







505 










510 






Arcr 


Ser 


Val 


Cvs 


Ser 


Glu 


Pro 


Cvs 










525 








Arcr 


Lvs 


Gly 


Glu 


Val 


Ser 


Cys 


Cvs 








540 










Asn 


Glu 


Phe 


Val 


Gin 


Asp 


Glu 


Phe 






555 










560 


Tro 


Tro 


Pro 


Asn 


Ala 


Glu 


Leu 


Thr 




570 










575 




Tvr 


Leu 


Glu 


Trp 


Gly 


Glu 


Pro 


Ala 


585 










590 






Ser 


Leu 


Ala 


Leu 


Glv 


Leu 


Val 


Leu 










605 








His 


Aver 


Ann 


Ser 


Pro 


Leu 


Val 


Gin 








620 










XT HC 


Gl v 


JJCU 


Val 


v. E> 


Leu 


Glv 
oxy 


Leu 






635 










640 




m v 

v?xy 


V9XXX 




Cay 
OCX 


AT X. U 


Ala 


Axg 




650 










655 




His 

X1JL a 


JJCU 


XrX V 


ucu 


Thr* 


Glv 


Cys 


Leu 


665 










670 






Glu 


He 


Phe 


Val 


Glu 


Ser 


Glu 


Leu 










685 








Ser 


Glv 

uxy 


Cys 


Leu 


Arcr 


Glv 

ox/ 


Pro 


Tro 








700 










Met 


Leu 


Val 


Glu 


Val 


Ala 


Leu 


Cys 






715 










720 


Pro 


Glu 


Val 


Val 


Thr 


Asp 


Axp 


His 




730 










735 




His 


Cys 


Arcr 


Thr 


Arcr 


Ser 


Tro 


Val 


745 










750 






Asn 


Ala 


Thr 


Leu 


Ala 


Phe 


Leu 
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NUCLEIC ACIDS ENCODING A G-PROTEIN COUPLED 
RECEPTOR INVOLVED IN SENSORY TRANSDUCTION 

CROSS-REFERENCES TO RELATED APPLICATIONS 
This application claims priority to USSN 60/094,465, filed July 28, 1998, 
herein incorporated by reference in its entirety. 

STATEMENT AS TO FEDERALLY SPONSORED RESEARCH AND 

DEVELOPMENT . 

This invention was made with government support under Grant No. 5R01 

DC03160, awarded by the National Institutes of Health. The government has certain 

rights in this invention. 

FIELD OF THE INVENTION 
The invention provides isolated nucleic acid and amino acid sequences of . 
sensory cell specific G-protein coupled receptors, antibodies to such receptors, methods 
of detecting such nucleic acids and receptors, and methods of screening for modulators of 
sensory cell specific G-protein coupled receptors. 

BACKGROUND OF THE INVENTION 
Taste transduction is one of the most sophisticated forms of 
chemotransduction in animals (see, e.g., Margolskee, BioEssays 15:645-650 (1993); 
Avenet & Lindemann, J. Membrane Biol. 112:1-8 (1989)). Gustatory signaling is found 
throughout the animal kingdom, from simple metazoans to the most complex of 
vertebrates; its main purpose is to provide a reliable signaling response to non-volatile 
ligands. Each of these modalities is though to be mediated by distinct signaling pathways 
mediated by receptors or channels, leading to receptor cell depolarization, generation of a 
receptor or action potential, and release of neurotransmitter at gustatory afferent neuron 
synapses {see, e.g., Ropex 9 Ann. Rev. Neurosci. 12:329-353 (1989)). 
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Mammals are believed to have five basic taste modalities: sweet, bitter, 
sour, salty and unami (the taste of monosodium glutamate) {see, e.g., Kawamura & Kare, 
Introduction to Unami: A Basic Taste (1987); Kinnamon & Cummings, Ann. Rev. 
Physiol. 54:715-731(1992); Lindemann, Physiol. Rev. 76:718-766 (1996); Stewart et aL, 
5 Am. J. Physiol. 272: 1 -26 ( 1 997)). Extensive psychophysical studies in humans have 
reported that different regions of the tongue display different gustatory preferences (see, 
e.g., Hoffmann, Mencken. Arch. Path. Anat. Physiol. 62:516-530 (1875); Bradley et al, 
Anatomical Record 212: 246-249 (1985); Miller & Reedy, Physiol. Behav. 47:1213-1219 
(1990)). Also, numerous physiological studies in animals have shown that taste receptor 
1 0 cells may selectively respond to different tastants (see, e.g. , Akabas et al. , Science 

242:1047-1050 (1988); Gilbertson et al., J. Gen. Physiol. 100:803-24 (1992); Bernhardt 
et al, J. Physiol 490:325-336 (1996); Cummings et al, J. Neurophysiol. 75:1256-1263 
(1996)). 

In mammals, taste receptor cells are assembled into taste buds that are 
1 5 distributed into different papillae in the tongue epithelium. Circumvallate papillae, found 
at the very back of the tongue, contain hundreds (mice) to thousands (human) of taste 
buds and are particularly sensitive to bitter substances. Foliate papillae, localized to the 
posterior lateral edge of the tongue, contain dozens to hundreds of taste buds and are 
particularly sensitive to sour and bitter substances. Fungiform papillae containing a 
20 single or a few taste buds are at the front of the tongue and are thought to mediate much 
of the sweet taste modality. 

Each taste bud, depending on the species, contain 50-150 cells, including 
precursor cells, support cells, and taste receptor cells (see, e.g., Lindemann, Physiol Rev. 
76:71 8-766 (1 996)). Receptor cells are innervated at their base by afferent nerve endings 
25 that transmit information to the taste centers of the cortex through synapses in the brain 
stem and thalamus. Elucidating the mechanisms of taste cell signaling and information 
processing is critical for understanding the function, regulation, and "perception" of the 
sense of taste. 

Although much is known about the psychophysics and physiology of taste 
30 cell function, very little is known about the molecules and pathways that mediate these 
sensory signaling responses (reviewed by Gilbertson, Current Opn. in Neurobiol 3:532- 
539 (1993)). Electrophysiological studies suggest that sour and salty tastants modulate 
taste cell function by direct entry of H* and Na + ions through specialized membrane 
channels on the apical surface of the cell. In the case of sour compounds, taste cell 
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depolarization is hypothesized to result from H + blockage of K + channels {see, e.g., 
Kinnamon et aL, Proc. Nat'lAcad. Sci. USA 85: 7023-7027 (1988)) or activation of pH- 
sensitive channels (see, e.g., Gilbertson et aL, J. Gen. Physiol. 100:803-24 (1992)); salt 
transduction may be partly mediated by the entry of Na + via amiloride-sensitive Na + 
5 channels (see, e.g., Heck et aL, Science 223:403-405 (1984); Brand et aL, Brain Res. 207- 
214 (1985); Avenet etaL, Nature 331: 351-354 (1988)). 

Sweet, bitter, and unami transduction are believed to be mediated by G- 
protein-coupled receptor (GPCR) signaling pathways (see, e.g., Striem et al.,Biochem. J. 
260:121-126 (1989); Chaudhari et aL, J. Neuros. 16:3817-3826 (1996); Wong et aL, 

10 Nature 381 : 796-800 (1996)). Confusingly, there are almost as many models of signaling 
pathways for sweet and bitter transduction as there are effector enzymes for GPCR 
cascades (e.g., G protein subunits, cGMP phosphodiesterase, phospholipase C, adenylate 
cyclase; see, e.g., Kinnamon & Margolskee, Curr. Opin. NeurobioL 6:506-513 (1996)). 
However, little is known about the specific membrane receptors involved in taste 

1 5 transduction, or many of the individual intracellular signaling molecules activated by the * 
individual taste transduction pathways. Identification of such molecules is important 
given the numerous pharmacological and food industry applications for bitter antagonists, 
sweet agonists, and modulators of salty and sour taste. 

The identification and isolation of taste receptors (including taste ion 

20 channels), and taste signaling molecules, such as G-protein subunits and enzymes 
involved in signal transduction, would allow for the pharmacological and genetic 
modulation of taste transduction pathways. For example, availability of receptor and 
channel molecules would permit the screening for high affinity agonists, antagonists, 
inverse agonists, and modulators of taste cell activity. Such taste modulating compounds 

25 could then be used in the pharmaceutical and food industries to customize taste. In 

addition, such taste cell specific molecules can serve as invaluable tools in the generation 
of taste topographic maps that elucidate the relationship between the taste cells of the 
tongue and taste sensory neurons leading to taste centers in the brain. 

30 SUMMARY OF THE INVENTION 

The present invention thus provides for the first time nucleic acids 
encoding a taste cell specific G-protein coupled receptor. These nucleic acids and the 
polypeptides that they encode are referred to as "GPCR-B3" for G-protein coupled 
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receptor ("GPCR") B3. These taste cell specific GPCRs are components of the taste 
transduction pathway. 

In one aspect, the present invention provides an isolated nucleic acid 
encoding a sensory transduction G-protein coupled receptor, the receptor comprising 
5 greater than about 70% amino acid identity to an amino acid sequence of SEQ ID NO:l, 
SEQ ID NO:2, or SEQ ID NO:3. 

In one embodiment, the nucleic acid comprises a nucleotide sequence of 
SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6. In another embodiment, the nucleic acid 
is amplified by primers that selectively hybridize under stringent hybridization conditions 
10 to the same sequence as degenerate primer sets encoding amino acid sequences selected 
from the group consisting of: IAWDWNGPKW (SEQ ID NO:7) and LPENYNEAKC 
(SEQ ID NO:8). 

In another aspect, the present invention provides an isolated nucleic acid 
encoding a sensory transduction G-protein coupled receptor, wherein the nucleic acid 
15 specifically hybridizes under highly stringent conditions to a nucleic acid having the 
sequence of SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6. 

In another aspect, the present invention provides an isolated nucleic acid 
encoding a sensory transduction G-protein coupled receptor, the receptor comprising 
greater than about 70% amino acid identity to a polypeptide "having a sequence of SEQ ID 
20 NO: 1 , SEQ ID NO:2, or SEQ ID NO:3 wherein the nucleic acid selectively hybridizes 
under moderately stringent hybridization conditions to a nucleotide sequence of SEQ ID 
NO:4, SEQ ED NO:5, or SEQ ID NO:6. 

In another aspect, the present invention provides an isolated nucleic acid 
encoding an extracellular domain of a sensory transduction G-protein coupled receptor, 
25 the extracellular domain having greater than about 70% amino acid sequence identity to 
the extracellular domain of SEQ ID NO:L 

. In another aspect, the present invention provides an isolated nucleic acid 
encoding a transmembrane domain of a sensory transduction G-protein coupled receptor, 
the transmembrane domain comprising greater than about 70% amino acid sequence 
30 identity to the transmembrane domain of SEQ ID NO: 1 . 

In another aspect, the present invention provides an isolated sensory 
transduction G-protein coupled receptor, the receptor comprising greater than about 70% 
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amino acid sequence identity to an amino acid sequence of SEQ ID NO:l, SEQ ID NO:2, 
orSEQIDNO:3. 

In one embodiment, the receptor specifically binds to polyclonal 
antibodies generated against SEQ ID NO:l, SEQ ID NO:2, or SEQ ID NO:3. In another 
5 embodiment, the receptor has G-protein coupled receptor activity. In another 

embodiment, the receptor has an amino acid sequence of SEQ ID NO:l, SEQ ID NO:2, or 
SEQ ED NO:3. In another embodiment, the receptor is from a human, a rat, or a mouse. 

In one aspect, the present invention provides an isolated polypeptide 
comprising an extracellular domain of a sensory transduction G-protein coupled receptor, 
10 the extracellular domain comprising greater than about 70% amino acid sequence identity 
to the extracellular domain of SEQ ID NO: 1 . 

In one embodiment, the polypeptide encodes the extracellular domain of 
SEQ ID NO: 1 . In another embodiment, the extracellular domain is covalently linked to a 
heterologous polypeptide, forming a chimeric polypeptide. 
1 5 In one aspect, the present invention provides an isolated polypeptide 

comprising a transmembrane domain of a sensory transduction G-protein coupled 
receptor, the transmembrane domain comprising greater than about 70% amino acid 
sequence identity to the transmembrane domain of SEQ ID NO: 1 . 

In one embodiment, the polypeptide encodes the transmembrane domain 
20 of SEQ ID NO: 1 . In another embodiment, the polypeptide further comprises a 
cytoplasmic domain comprising greater than about 70% amino acid identity to the 
cytoplasmic domain of SEQ ID NO: 1 . In another embodiment, the polypeptide encodes 
the cytoplasmic domain of SEQ ID NO: 1 . In another embodiment, the transmembrane 
domain is covalently linked to a heterologous polypeptide, forming a chimeric 
25 polypeptide. In another embodiment, the chimeric polypeptide has G-protein coupled 
receptor activity. 

In one aspect, the present invention provides an antibody that selectively 
binds to the receptor comprising greater than about 70% amino acid sequence identity to 
an amino acid sequence of SEQ ID NO: 1, SEQ ID NO:2, or SEQ ID NO:3. 
30 In another aspect, the present invention provides an expression vector 

comprising a nucleic acid encoding a polypeptide comprising greater than about 70% 
amino acid sequence identity to an amino acid sequence of SEQ ID NO:l, SEQ ID NO:2, 
orSEQIDNO:3. 
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In another aspect, the present invention provides a host cell transfected 
with the expression vector. 

In another aspect, the present invention provides a method for identifying a 
compound that modulates sensory signaling in sensory cells, the method comprising the 
5 steps of: (i) contacting the compound with a polypeptide comprising an extracellular 
domain of a sensory transduction G-protein coupled receptor, the extracellular domain 
comprising greater than about 70% amino acid sequence identity to the extracellular 
domain of SEQ ID NO:l, SEQ ID NO:2, or SEQ ID NO:3; and (ii) determining the 
functional effect of the compound upon the extracellular domain. 

1 0 In another aspect, the present invention provides a method for identifying a 

compound that modulates sensory signaling in sensory cells, the method comprising the 
steps of: (i) contacting the compound with a polypeptide comprising an extracellular 
domain of a sensory transduction G-protein coupled receptor, the transmembrane domain 
comprising greater than about 70% amino acid sequence identity to the extracellular 

15 domain of SEQ ID NO:l, SEQ ID NO:2, or SEQ ID NO:3; and (ii) determining the 
functional effect of the compound upon the transmembrane domain. 

In one embodiment, the polypeptide is a sensory transduction G-protein 
coupled receptor, the receptor comprising greater than about 70% amino acid identity to a 
polypeptide encoding SEQ ID NO:l, SEQ ID NO:2, or SEQ ID NO:3. In another 

20 embodiment, polypeptide comprises an extracellular domain that is covalently linked to a 
heterologous polypeptide, forming a chimeric polypeptide. In another embodiment, the 
polypeptide has G-protein coupled receptor activity. In another embodiment, the 
extracellular domain is linked to a solid phase, either covalently or non-covalently. In 
another embodiment, the functional effect is determined by measuring changes in 

25 intracellular cAMP, IP3, or Ca 2+ . In another embodiment, the functional effect is a 
chemical effect. In another embodiment, the functional effect is a chemical effect. In 
another embodiment, the functional effect is determined by measuring binding of the 
compound to the extracellular domain. In another embodiment, the polypeptide is 
recombinant. In another embodiment, the polypeptide is expressed in a cell or cell 

30 membrane. In another embodiment, the cell is a eukaryotic cell. 

In one embodiment, the polypeptide comprises an transmembrane domain 
that is covalently linked to a heterologous polypeptide, forming a chimeric polypeptide. 

In one aspect, the present invention provides a method of making a 
sensory transduction G-protein coupled receptor, the method comprising the step of 
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expressing the receptor from a recombinant expression vector comprising a nucleic acid 
encoding the receptor, wherein the amino acid sequence of the receptor comprises greater 
than about 70% amino acid identity to a polypeptide having a sequence of SEQ ID NO:l, 
SEQ ID NO:2, or SEQ ID NO:3. 
5 In one aspect, the present invention provides a method of making a 

recombinant cell comprising a sensory transduction G-protein coupled receptor, the 
method comprising the step of transducing the cell with an expression vector comprising 
a nucleic acid encoding the receptor, wherein the amino acid sequence of the receptor 
comprises greater than about 70% amino acid identity to a polypeptide having a sequence 

10 of SEQ ID NO:l, SEQ ID NO:2, or SEQ ID NO:3. 

In one aspect, the present invention provides a method of making an 
recombinant expression vector comprising a nucleic acid encoding a sensory transduction 
G-protein coupled receptor, the method comprising the step of ligating to an expression 
vector a nucleic acid encoding the receptor, wherein the amino acid sequence of the 

1 5 receptor comprises greater than about 70% amino acid identity to a polypeptide having a 
sequence of SEQ ID NO:l, SEQ ID NO:2, or SEQ ID NO:3. 

BRIEF DESCRIPTION OF THE DRAWINGS 
20 Figure 1 shows the proposed topology of GPCR-B3, with a large 

extracellular domain extending from amino acid 1 to about amino acid 580 of the rat 
GPCR-B3 amino acid sequence (corresponding to nucleotide residues 1-1740 of the rat 
sequence, with the ATG initiator methionine defined as residue 1), and seven 
transmembrane domains. The large extracellular domain may extend into the first 
25 transmembrane domain. Dark residues indicate identities between GPCR-B3 and GPCR- 
B4 (for a description of GPCR-B4, see, e.g., USSN 60/095,464, filed July 28, 1998, and 
USSN 60/1 12,747, filed December 17, 1998; see also Hoon et al, Cell 96:541-551 
(1999)). 

Figure 2 is a western blot showing GPCR-B3 protein expression in taste 
30 buds but not in non-taste tissue. Using PCR assays, the following non-tongue tissues 
were screened for GPCR-B3 expression-brain, liver, olfactory epithelium, VNO, and 
heart. GPCR-B3 was expressed only in taste tissue (data not shown). 
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Figure 3 shows in situ hybridization of tongue tissue sections showing 
labeling of GPCR-B3 in taste receptor cells of taste buds, but not in adjacent non-taste 
tissue. 

Figure 4 shows a chimeric receptor containing the entire extracellular 
5 domain of the murine mGluRl receptor and the transmembrane domain comprising seven 
transmembrane regions and corresponding cytosolic loops, and C-terminal end from 
murine GPCR-B3. 

Figure 5 shows HEK cells transfected with the chimeric glutamate/GPCR- 
B3 receptor described in Figure 4. Figure 5 shows calcium response to glutamate, 
10 demonstrating robust coupling of the chimeric receptor to phospholipase C. These results 
indicate that the chimeric glutamate/GPCR-B3 can couple to the promiscuous G protein 
Gal 5 and trigger calcium responses that are detectable using the indicator Fura-2. 

DETAILED DESCRIPTION OF THE INVENTION 

15 I. Introduction 

The present invention provides for the first time nucleic acids encoding a 
taste cell specific G-protein coupled receptor. These nucleic acids and the receptors that 
they encode are referred to as "GPCR" for G-protein coupled receptor, and are designated 
as GPCR-B3. These taste cell specific GPCR are components of the taste transduction 

20 pathway. These nucleic acids provide valuable probes for the identification of taste cells, 
as the nucleic acids are specifically expressed in taste cells. For example, probes for 
GPCR polypeptides and proteins can be used to identity subsets of taste cells such as 
foliate cells and circumvallate cells, or specific taste receptor cells, e.g., sweet, sour, salty, 
and bitter. They also serve as tools for the generation of taste topographic maps that 

25 elucidate the relationship between the taste cells of the tongue and taste sensory neurons 
leading to taste centers in the brain. Furthermore, the nucleic acids and the proteins they 
encode can be used as probes to dissect taste-induced behaviors. 

The invention also provides methods of screening for modulators, e.g., 
activators, inhibitors, stimulators, enhancers, agonists, and antagonists, of these novel 

30 taste cell GPCRs. Such modulators of taste transduction are useful for pharmacological 
and genetic modulation of taste signaling pathways. These methods of screening can be 
used to identify high affinity agonists and antagonists of taste cell activity. These 
modulatory compounds can then be used in the food and pharmaceutical industries to 
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customize taste. Thus, the invention provides assays for taste modulation, where GPCR- 
B3 acts as an direct or indirect reporter molecule for the effect of modulators on taste 
transduction. GPCRs can be used in assays, e.g., to measure changes in ion 
concentration, membrane potential, current flow, ion flux, transcription, signal 
5 transduction, receptor-ligand interactions, second messenger concentrations, in vitro, in 
vivo, and ex vivo. In one embodiment, GPCR-B3 can be used as an indirect reporter via 
attachment to a second reporter molecule such as green fluorescent protein (see, e.g., 
Mistili & Spector, Nature Biotechnology 15:961-964 (1997)). In another embodiment, 
GPCR-B3 is recombinantly expressed in cells, and modulation of taste transduction via 

10 GPCR activity is assayed by measuring changes in Ca 2+ levels. 

Methods of assaying for modulators of taste transduction include in vitro 
ligand binding assays using GPCR-B3, portions thereof such as the extracellular domain, 
or chimeric proteins comprising one or more domains of GPCR-B3, oocyte GPCR-B3 
expression; tissue culture cell GPCR-B3 expression; transcriptional activation of GPCR- 

15 B3; phosphorylation and dephosphorylation of GPCRs; G-protein binding to GPCRs; 
ligand binding assays; voltage, membrane potential and conductance changes; ion flux 
assays; changes in intracellular second messengers such as cAMP and inositol 
triphosphate; changes in intracellular calcium levels; and neurotransmitter release. 

Finally, the invention provides for methods of detecting GPCR-B3 nucleic 

20 acid and protein expression, allowing investigation of taste transduction regulation and 
specific identification of taste receptor cells. GPCR-B3 also provides useful nucleic acid 
probes for paternity and forensic investigations. GPCR-B3 is a useful nucleic acid probe 
for identifying subpopulations of taste receptor cells such as foliate, fungiform, and 
circumvallate taste receptor cells. GPCR-B3 receptors can also be used to generate 

25 monoclonal and polyclonal antibodies useful for identifying taste receptor cells. Taste 
receptor cells can be identified using techniques such as reverse transcription and 
amplification of mRNA, isolation of total RNA or poly A* RNA, northern blotting, dot 
blotting, ip situ hybridization, RNase protection, SI digestion, probing DNA microchip 
arrays, western blots, and the like. 

30 Functionally, GPCR-B3 represents a seven transmembrane G-protein 

coupled receptor involved in taste transduction, which interacts with a G-protein to 
mediate taste signal transduction (see, e.g., Fong, Cell Signal 8:217 (1996); Baldwin, 
Curr. Opin. Cell Biol. 6:180 (1994)). 
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Structurally, the nucleotide sequence of GPCR-B3 (see, e.g., SEQ ID 
NOS:4-6, isolated from rat, mouse, and human respectively) encodes a polypeptide of 
approximately 840 amino acids with a predicted molecular weight of approximately 97 
kDa and a predicted range of 92-102 kDa (see, e.g., SEQ ID NOS:l-3). Related GPCR- 
B3 genes from other species share at least about 70% amino acid identity over a amino 
acid region at least about 25 amino acids in length, optionally 50 to 100 amino acids in 
length. GPCR-B3 is specifically expressed in foliate and fungiform cells, with lower 
expression in circum vallate taste receptor cells of the tongue. GPCR-B3 is an moderately 
rare sequence found in approximately 1/1 50,000 cDNAs from an oligo-dT primed 
circumvallate cDNA library (see Example 1). 

The present invention also provides polymorphic variants of the GPCR-B3 
depicted in SEQ ID NO: 1 : variant #1, in which an isoleucine residue is substituted for a 
leucine acid residue at amino acid position 33; variant #2, in which an aspartic acid 
residue is substituted for a glutamic acid residue at amino acid position 84; and variant 
#3, in which a glycine residue is substituted for an alanine residue at amino acid position 
90. 

Specific regions of the GPCR-B3 nucleotide and amino acid sequence may 
be used to identify polymorphic variants, interspecies homologs, and alleles of GPCR-B3. 
This identification can be made in vitro, e.g., under stringent hybridization conditions or 
PCR (using primers encoding SEQ ID NOS:7-8) and sequencing, or by using the 
sequence information in a computer system for comparison with other nucleotide 
sequences. Typically, identification of polymorphic variants and alleles of GPCR-B3 is 
made by comparing an amino acid sequence of about 25 amino acids or more, e.g., 50- 
100 amino acids. Amino acid identity of approximately at least 70% or above, optionally 
80%, optionally 90-95% or above typically demonstrates that a protein is a polymorphic 
variant, interspecies homolog, or allele of GPCR-B3. Sequence comparison can be 
performed using any of the sequence comparison algorithms discussed below. Antibodies 
that bind specifically to GPCR-B3 or a conserved region thereof can also be used to 
identify alleles, interspecies homologs, and polymorphic variants. 

Polymorphic variants, interspecies homologs, and alleles of GPCR-B3 are 
confirmed by examining taste cell specific expression of the putative GPCR-B3 
polypeptide. Typically, GPCR-B3 having the amino acid sequence of SEQ ID NO:l-3 is 
used as a positive control in comparison to the putative GPCR-B3 protein to demonstrate 
the identification of a polymorphic variant or allele of GPCR-B3. The polymorphic 
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variants, alleles and interspecies homologs are expected to retain the seven 
transmembrane structure of a G-protein coupled receptor. 

GPCR-B3 nucleotide and amino acid sequence information may also be 
used to construct models of taste cell specific polypeptides in a computer system. These 
models are subsequently used to identify compounds that can activate or inhibit GPCR- 
B3. Such compounds that modulate the activity of GPCR B4 can be used to investigate 
the role of GPCR-B3 in taste transduction. 

The isolation of GPCR-B3 for the first time provides a means for assaying 
for inhibitors and activators of G-protein coupled receptor taste transduction . 
Biologically active GPCR-B3 is useful for testing inhibitors and activators of GPCR-B3 
as taste transducers using in vivo and in vitro expression that measure, e.g., transcriptional 
activation of GPCR-B3; ligand binding; phosphorylation and dephosphorylation; binding 
to G-proteins; G-protein activation; regulatory molecule binding; voltage, membrane 
potential and conductance changes; ion flux; intracellular second messengers such as 
cAMP and inositol triphosphate; intracellular calcium levels; and neurotransmitter 
release. Such activators and inhibitors identified using GPCR-B3, can be used to further 
study taste transduction and to identify specific taste agonists and antagonists. Such 
activators and inhibitors are useful as pharmaceutical and food agents for customizing 
taste. 

Methods of detecting GPCR-B3 nucleic acids and expression of GPCR-B3 
are also useful for identifying taste cells and creating topological maps of the tongue and 
the relation of tongue taste receptor cells to taste sensory neurons in the brain. 
Chromosome localization of the genes encoding human GPCR-B3 can be used to identify 
diseases, mutations, and traits caused by and associated with GPCR-B3. 

II. Definitions 

As used herein, the following terms have the meanings ascribed to them 
unless specified otherwise. N 

"Taste receptor cells" are neuroepithelial cells that are organized into 
groups to form taste buds of the tongue, e.g., foliate, fungiform, and circumvallate cells 
(see, e.g., Roper etai, Ann. Rev. Neurosci. 12:329-353 (1989)). 

"GPCR-B3," also called "TR1 refers to a G-protein coupled receptor is 
specifically expressed in taste receptor cells such as foliate, fungiform, and circumvallate 
cells (see, e.g., Hoon et al, Cell 96:541-551 (1999), herein incorporated by reference in 
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its entirety). Such taste cells can be identified because they express specific molecules 
such as Gustducin, a taste cell specific G protein (McLaughin et al, Nature 357:563-569 
(1992)). Taste receptor cells can also be identified on the basis of morphology (see, e.g., 
Roper, supra). 

5 GPCR-B3 encodes GPCRs with seven transmembrane regions that have 

"G-protein coupled receptor activity," e.g., they bind to G-proteins in response to 
extracellular stimuli and promote production of second messengers such as IP 3, cAMP, 
and Ca 2+ via stimulation of enzymes such as phospholipase C and adenylate cyclase (for a 
description of the structure and function of GPCRs, see, e.g., Fong, supra, and Baldwin, 
10 supra). 

The term GPCR-B3 therefore refers to polymorphic variants, alleles, 
mutants, and interspecies homologs that: (1) have about 70% amino acid sequence 
identity, optionally about 75, 80, 85, 90, or 95% amino acid sequence identity to SEQ ID 
NOS:l-3 over a window of about 25 amino acids, optionally 50-100 amino acids; (2) bind 

1 5 to antibodies raised against an immunogen comprising an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-3 and conservatively modified variants 
thereof; (3) specifically hybridize (with a size of at least about 500, optionally at least 
about 900 nucleotides) under stringent hybridization conditions to a sequence selected 
from the group consisting of SEQ ID NO:4-6, and conservatively modified variants 

20 thereof; or (4) are amplified by primers that specifically hybridize under stringent 

hybridization conditions to the same sequence as a degenerate primer sets encoding SEQ 
IDNOS:7-8. 

Topologically, sensory GPCRs have an N-terminal "extracellular domain," 
a "transmembrane domain" comprising seven transmembrane regions and corresponding 

25 cytoplasmic and extracellular loops, and a C-terminal "cytoplasmic domain" (see Figure 
1; see also Hoon et al, Cell 96:541-551 (1999); Buck & Axel, Cell 65:175-187 (1991)). 
These domains can be structurally identified using methods known to those of skill in the 
art, such as sequence analysis programs that identify hydrophobic and hydrophilic 
domains (see, e.g., Kyte & Doolittle, J. Mol Biol. 157:105-132 (1982)). Such domains 

30 are useful for making chimeric proteins and for in vitro assays of the invention. 

"Extracellular domain" therefore refers to the domain of GPCR-B3 that 
protrudes from the cellular membrane and binds to extracellular ligand. This region starts 
at the N-terminus and ends approximately at the conserved glutamic acid at amino acid 
position 563 plus or minus approximately 20 amino acids. The region corresponding to 
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amino acids 1-580 of SEQ ID NO:l (nucleotides 1-1740, with nucleotide 1 starting at the 
ATG initiator methionine codon; see also Figure 1) is one embodiment of an extracellular 
domain that extends slightly into the transmembrane domain. This embodiment is useful 
for in vitro ligand binding assays, both soluble and solid phase. 
5 "Transmembrane domain," comprising seven transmembrane regions plus 

the corresponding cytoplasmic and extracellular loops, refers to the domain of GPCR-B3 
that starts approximately at the conserved glutamic acid residue at amino acid position 
563 plus or minus approximately 20 amino acids and ends approximately at the conserved 
tyrosine amino acid residue at position 812 plus or minus approximately 10 amino acids. 

10 "Cytoplasmic domain" refers to the domain of GPCR-B3 that starts at the 

conserved tyrosine amino acid residue at position 812 plus or minus approximately 10 
amino acids and continues to the C-terminus of the polypeptide. 

"Biological sample" as used herein is a sample of biological tissue or fluid 
that contains GPCR-B3 or nucleic acid encoding GPCR-B3 protein. Such samples 

1 5 include, but are not limited to, tissue isolated from humans, mice, and rats, in particular, 
ton. Biological samples may also include sections of tissues such as frozen sections taken 
for histological purposes. A biological sample is typically obtained from a eukaryotic 
organism, such as insects, protozoa, birds, fish, reptiles, and preferably a mammal such as 
rat, mouse, cow, dog, guinea pig, or rabbit, and most preferably a primate such as 

20 chimpanzees or humans. Tissues include tongue tissue, isolated taste buds, and testis 
tissue. 

"GPCR activity" refers to the ability of a GPCR to transduce a signal. 
Such activity can be measured in a heterologous cell, by coupling a GPCR (or a chimeric 
GPCR) to either a G-protein or promiscuous G-protein such as Gal 5, and an enzyme 

25 such as PLC, and measuring increases in intracellular calcium using (Offermans & 
Simon, J. Biol. Chem. 270:15175-15180 (1995)). Receptor activity can be effectively 
measured by recording ligand-induced changes in [Ca 2+ ]i using fluorescent Ca 2+ -indicator 
dyes and fluorometric imaging. Optionally, the polypeptides of the invention are 
involved in sensory transduction, optionally taste transduction in taste cells. 

30 The phrase "functional effects" in the context of assays for testing 

compounds that modulate GPCR-B3 mediated taste transduction includes the 
determination of any parameter that is indirectly or directly under the influence of the 
receptor, e.g., functional, physical and chemical effects. It includes ligand binding, 
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changes in ion flux, membrane potential, current flow, transcription, G-protein binding, 
GPCR phosphorylation or dephosphorylation, signal transduction, receptor-ligand 
interactions, second messenger concentrations (e.g., cAMP, EP3, or intracellular Ca 2+ ), in 
vitro, in vivo, and ex vivo and also includes other physiologic effects such increases or 
5 decreases of neurotransmitter or hormone release. 

By "determining the functional effect" is meant assays for a compound 
that increases or decreases a parameter that is indirectly or directly under the influence of 
GPCR-B3, e.g., functional, physical and chemical effects. Such functional effects can be 
measured by any means known to those skilled in the art, e.g., changes in spectroscopic 

10 characteristics (e.g., fluorescence, absorbance, refractive index), hydrodynamic (e.g., 

shape), chromatographic, or solubility properties, patch clamping, voltage-sensitive dyes, 
whole cell currents, radioisotope efflux, inducible markers, oocyte GPCR-B3 expression; 
tissue culture cell GPCR-B3 expression; transcriptional activation of GPCR-B3; ligand 
binding assays; voltage, membrane potential and conductance changes; ion flux assays; 

15 changes in intracellular second messengers such as cAMP and inositol triphosphate (IP3); 
changes in intracellular calcium levels; neurotransmitter release, and the like. 

"Inhibitors," "activators," and "modulators" of GPCR-B3 are used 
interchangeably to refer to inhibitory, activating, or modulating molecules identified 
using in vitro and in vivo assays for taste transduction, e.g., ligands, agonists, antagonists, 

20 and their homologs and mimetics. Inhibitors are compounds that, e.g., bind to, partially or 
totally block stimulation, decrease, prevent, delay activation, inactivate, desensitize, or 
down regulate taste transduction, e.g., antagonists. Activators are compounds that, e.g., 
bind to, stimulate, increase, open, activate, facilitate, enhance activation, sensitize or up 
regulate taste transduction, e.g., agonists. Modulators include compounds that, e.g., alter 

25 the interaction of a receptor with: extracellular proteins that bind activators or inhibitor 
(e.g., ebnerin and other members of the hydrophobic carrier family); G -proteins; kinases 
(e.g., homologs of rhodopsin kinase and beta adrenergic receptor kinases that are 
involved in deactivation and desensitization of a receptor); and arrestin-like proteins, 
which also deactivate and desensitize receptors. Modulators include genetically modified 

30 versions of GPCR-B3, e.g., with altered activity, as well as naturally occurring and 
synthetic ligands, antagonists, agonists, small chemical molecules and the like. Such 
assays for inhibitors and activators include, e.g., expressing GPCR-B3 in cells or cell 
membranes, applying putative modulator compounds, and then determining the functional 
effects on taste transduction, as described above. Samples or assays comprising GPCR- 
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B3 that are treated with a potential activator, inhibitor, or modulator are compared to 
control samples without the inhibitor, activator, or modulator to examine the extent of 
inhibition. Control samples (untreated with inhibitors) are assigned a relative GPCR-B3 
activity value of 100%. Inhibition of GPCR-B3 is achieved when the GPCR-B3 activity 
5 value relative to the control is about 80%, optionally 50%, 25-0%. Activation of GPCR- 
B3 is achieved when the GPCR-B3 activity value relative to the control is 110%, 
optionally 1 50%, 200-500%, 1 000-3000% higher. 

"Biologically active" GPCR-B3 refers to GPCR-B3 having GPCR activity 
as described above, involved in taste transduction in taste receptor cells. 

10 The terms "isolated" "purified" or "biologically pure" refer to material that 

is substantially or essentially free from components which normally accompany it as 
found in its native state. Purity and homogeneity are typically determined using 
analytical chemistry techniques such as polyacrylamide gel electrophoresis or high 
performance liquid chromatography. A protein that is the predominant species present in 

1 5 a preparation is substantially purified. In particular, an isolated GPCR-B3 nucleic acid is 
separated from open reading frames that flank the GPCR-B3 gene and encode proteins 
other than GPCR-B3. The term "purified" denotes that a nucleic acid or protein gives rise 
to essentially one band in an electrophoretic gel. Particularly, it means that the nucleic 
acid or protein is at least 85% pure, optionally at least 95% pure, and optionally at least 

20 99% pure. 

"Nucleic acid" refers to deoxyribonucleotides or ribonucleotides and 
polymers thereof in either single- or double-stranded form. The term encompasses 
nucleic acids containing known nucleotide analogs or modified backbone residues or 
linkages, which are synthetic, naturally occurring, and non-naturally occurring, which 

25 have similar binding properties as the reference nucleic acid, and which are metabolized 
in a manner similar to the reference nucleotides. Examples of such analogs include, 
without limitation, phosphorothioates, phosphoramidates, methyl phosphonates, chiral- 
methyl phosphonates, 2-O-methyl ribonucleotides, peptide-nucleic acids (PNAs). 

Unless otherwise indicated, a particular nucleic acid sequence also 

30 implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon 
substitutions) and complementary sequences, as well as the sequence explicitly indicated. 
Specifically, degenerate codon substitutions may be achieved by generating sequences in 
which the third position of one or more selected (or all) codons is substituted with mixed- 
base and/or deoxyinosine residues (Batzer et ai y Nucleic Acid Res. 19:5081 (1991); 
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Ohtsuka et al, J. Biol Chem. 260:2605-2608 (1985); Rossolini et ai 9 Mol Cell Probes 
8:91-98 (1994)). The term nucleic acid is used interchangeably with gene, cDNA, 
mRNA, oligonucleotide, and polynucleotide. 

The terms ''polypeptide," "peptide" and "protein" are used interchangeably 
herein to refer to a polymer of amino acid residues. The terms apply to amino acid 
polymers in which one or more amino acid residue is an artificial chemical mimetic of a 
corresponding naturally occurring amino acid, as well as to naturally occurring amino 
acid polymers and non-naturally occurring amino acid polymer. 

The term "amino acid" refers to naturally occurring and synthetic amino 
acids, as well as amino acid analogs and amino acid mimetics that function in a manner 
similar to the naturally occurring amino acids. Naturally occurring amino acids are those 
encoded by the genetic code, as well as those amino acids that are later modified, e.g., 
hydroxyproline, y-carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to 
compounds that have the same basic chemical structure as a naturally occurring amino 
acid, i.e., an a carbon that is bound to a hydrogen, a carboxyl group, an amino group, and 
an R group, e.g., homoserine, norleucine, methionine sulfoxide, methionine methyl 
sulfonium. Such analogs have modified R groups (e.g., norleucine) or modified peptide 
backbones, but retain the same basic chemical structure as a naturally occurring amino 
acid. Amino acid mimetics refers to chemical compounds that have a structure that is 
different from the general chemical structure of an amino acid, but that functions in a 
manner similar to a naturally occurring amino acid. 

Amino acids may be referred to herein by either their commonly known 
three letter symbols or by the one-letter symbols recommended by the IUPAC-IUB 
Biochemical Nomenclature Commission. Nucleotides, likewise, may be referred to by 
their commonly accepted single-letter codes. 

"Conservatively modified variants" applies to both amino acid and nucleic 
acid sequences. With respect to particular nucleic acid sequences, conservatively 
modified variants refers to those nucleic acids which encode identical or essentially 
identical amino acid sequences, or where the nucleic acid does not encode an amino acid 
sequence, to essentially identical sequences. Because of the degeneracy of the genetic 
code, a large number of functionally identical nucleic acids encode any given protein. 
For instance, the codons GCA, GCC, GCG and GCU all encode the amino acid alanine. 
Thus, at every position where an alanine is specified by a codon, the codon can be altered 
to any of the corresponding codons described without altering the encoded polypeptide. 
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Such nucleic acid variations are "silent variations," which are one species of 
conservatively modified variations. Every nucleic acid sequence herein which encodes a 
polypeptide also describes every possible silent variation of the nucleic acid. One of skill 
will recognize that each codon in a nucleic acid (except AUG, which is ordinarily the 
5 only codon for methionine, and TGG, which is ordinarily the only codon for tryptophan) 
can be modified to yield a functionally identical molecule. Accordingly, each silent 
variation of a nucleic acid which encodes a polypeptide is implicit in each described 
sequence. 

As to amino acid sequences, one of skill will recognize that individual 
1 0 substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 

sequence which alters, adds or deletes a single amino acid or a small percentage of amino 
acids in the encoded sequence is a "conservatively modified variant" where the alteration 
results in the substitution of an amino acid with a chemically similar amino acid. 
Conservative substitution tables providing functionally similar amino acids are well 
15 known in the art. Such conservatively modified variants are in addition to and do not 
exclude polymorphic variants, interspecies homologs, and alleles of the invention. 

The following eight groups each contain amino acids that are conservative 
substitutions for one another: 



1) 


Alanine (A), Glycine (G); 


20 2) 


Aspartic acid (D), Glutamic acid (E); 


3) 


Asparagine (N), Glutamine (Q); 


4) 


Arginine (R), Lysine (K); 


5) 


Isoleucine (I), Leucine (L), Methionine (M), Valine (V); 


6) 


Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 


25 7) 


Serine (S), Threonine (T); and 


8) 


Cysteine (C), Methionine (M) 



(see, e.g., Creighton, Proteins (1984)). 

Macromolecular structures such as polypeptide structures can be described 
in terms of various levels of organization. For a general discussion of this organization, 
30 see, e.g., Alberts et al. y Molecular Biology of the Cell (3 rd ed., 1994) and Cantor and 
Schimmel, Biophysical Chemistry Part I: The Conformation of Biological 
Macromolecules ( 1 980). "Primary structure" refers to the amino acid sequence of a 
particular peptide. "Secondary structure" refers to locally ordered, three dimensional 
structures within a polypeptide. These structures are commonly known as domains. 
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Domains are portions of a polypeptide that form a compact unit of the polypeptide and 
are typically 50 to 350 amino acids long. Typical domains are made up of sections of 
lesser organization such as stretches of p-sheet and a-helices. "Tertiary structure" refers 
to the complete three dimensional structure of a polypeptide monomer. "Quaternary 
5 structure" refers to the three dimensional structure formed by the noncovalent association 
of independent tertiary units.. Anisotropic terms are also known as energy terms. 

A "label" or a "detectable moiety" is a composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, or chemical means. For 
example, useful labels include 32 P, fluorescent dyes, electron-dense reagents, enzymes 

10 (e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins for 
which ant or 7 can be made detectable, e.g., by incorporating a radiolabel into the peptide, 
and used to detect antibodies specifically reactive with the peptide). 

A "labeled nucleic acid probe or oligonucleotide" is one that is bound, 
either covalently, through a linker or a chemical bond, or noncovalently, through ionic, 

1 5 van der Waals, electrostatic, or hydrogen bonds to a label such that the presence of the 
probe may be detected by detecting the presence of the label bound to the probe. 

As used herein a "nucleic acid probe or oligonucleotide" is defined as a 
nucleic acid capable of binding to a target nucleic acid of complementary sequence 
through one or more types of chemical bonds, usually through complementary base 

20 pairing, usually through hydrogen bond formation. As used herein, a probe may include 
natural (i.e., A, G, C, or T) or modified bases (7-deazaguanosine, inosine, etc.). In 
addition, the bases in a probe may be joined by a linkage other than a phosphodi ester 
bond, so long as it does not interfere with hybridization. Thus, for example, probes may 
be peptide nucleic acids in which the constituent bases are joined by peptide bonds rather 

25 than phosphodiester linkages. It will be understood by one of skill in the art that probes 
may bind target sequences lacking complete complementarity with the probe sequence 
depending upon the stringency of the hybridization conditions. The probes are optionally 
directly labeled as with isotopes, chromophores, lumiphores, chromogens, or indirectly 
labeled such as with biotin to which a streptavidin complex may later bind. By assaying 

30 for the presence or absence of the probe, one can detect the presence or absence of the 
select sequence or subsequence. 

The term "recombinant" when used with reference, e.g., to a cell, or 
nucleic acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has 



18 



WO 00/06592 PCT/US99/17099 

been modified by the introduction of a heterologous nucleic acid or protein or the 
alteration of a native nucleic acid or protein, or that the cell is derived from a cell so 
modified. Thus, for example, recombinant cells express genes that are not found within 
the native (non-recombinant) form of the cell or express native genes that are otherwise 
5 abnormally expressed, under expressed or not expressed at all. 

The term "heterologous" when used with reference to portions of a nucleic 
acid indicates that the nucleic acid comprises two or more subsequences that are not 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences from unrelated genes 

10 arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 
coding region from another source. Similarly, a heterologous protein indicates that the 
protein comprises two or more subsequences that are not found in the same relationship to 
each other in nature (e.g., a fusion protein). 

A "promoter" is defined as an array of nucleic acid control sequences that 

15 direct transcription of a nucleic acid. As used herein, a promoter includes necessary 
nucleic acid sequences near the start site of transcription, such as, in the case of a 
polymerase II type promoter, a TATA element. A promoter also optionally includes 
distal enhancer or repressor elements, which can be located as much as several thousand 
base pairs from the start site of transcription. A "constitutive" promoter is a promoter that 

20 is active under most environmental and developmental conditions. An "inducible" 

promoter is a promoter that is active under environmental or developmental regulation. 
The term "operably linked" refers to a functional linkage between a nucleic acid 
expression control sequence (such as a promoter, or array of transcription factor binding 
sites) and a second nucleic acid sequence, wherein the expression control sequence 

25 directs transcription of the nucleic acid corresponding to the second sequence. 

An "expression vector" is a nucleic acid construct, generated 
recombinantly or synthetically, with a series of specified nucleic acid elements that 
permit transcription of a particular nucleic acid in a host cell. The expression vector can 
be part of a plasmid, virus, or nucleic acid fragment. Typically, the expression vector 

30 includes a nucleic acid to be transcribed operably linked to a promoter. 

The terms "identical" or percent "identity," in the context of two or more 
nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences 
that are the same or have a specified percentage of amino acid residues or nucleotides that 
are the same (i.e., 70% identity, optionally 75%, 80%, 85%, 90%, or 95% identity over a 
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specified region), when compared and aligned for maximum correspondence over a 
comparison window, or designated region as measured using one of the following 
sequence comparison algorithms or by manual alignment and visual inspection. Such 
sequences are then said to be "substantially identical." This definition also refers to the 
5 compliment of a test sequence. Optionally, the identity exists over a region that is at least 
about 50 amino acids or nucleotides in length, or more preferably over a region that is 75- 
100 amino acids or nucleotides in length. 

For sequence comparison, typically one sequence acts as a reference 
sequence, to which test sequences are compared. When using a sequence comparison 

10 algorithm, test and reference sequences are entered into a computer, subsequence 

coordinates are designated, if necessary, and sequence algorithm program parameters are 
designated. Default program parameters can be used, or alternative parameters can be 
designated. The sequence comparison algorithm then calculates the percent sequence 
identities for the test sequences relative to the reference sequence, based on the program 

15 parameters. 

A "comparison window", as used herein, includes reference to a segment 
of any one of the number of contiguous positions selected from the group consisting of 
from 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in 
which a sequence may be compared to a reference sequence of the same number of 

20 contiguous positions after the two sequences are optimally aligned. Methods of 

alignment of sequences for comparison are well-known in the art. Optimal alignment of 
sequences for comparison can be conducted, e.g., by the local homology algorithm of 
Smith & Waterman, Adv. Appl Math. 2:482 (1981), by the homology alignment 
algorithm of Needleman & Wunsch, J. Mol Biol 48:443 (1970), by the search for 

25 similarity method of Pearson & Lipman, Proc. Natl Acad. ScL USA 85:2444 (1988), by 
computerized implementations of these algorithms (GAP, BESTFIT, FAST A, and 
TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 
Science Dr., Madison, WI), or by manual alignment and visual inspection (see, e.g., 
Current Protocols in Molecular Biology (Ausubel et al. y eds. 1995 supplement)). 

30 One example of a useful algorithm is PILEUP. PILEUP creates a multiple 

sequence alignment from a group of related sequences using progressive, pairwise 
alignments to show relationship and percent sequence identity. It also plots a tree or 
dendogram showing the clustering relationships used to create the alignment. PILEUP 
uses a simplification of the progressive alignment method of Feng & Doolittle, J. Mol 
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Evol 35:351-360 (1987). The method used is similar to the method described by Higgins 
& Sharp, CABIOS 5:151-153 (1989). The program can align up to 300 sequences, each 
of a maximum length of 5,000 nucleotides or amino acids. The multiple alignment 
procedure begins with the pairwise alignment of the two most similar sequences, 
5 producing a cluster of two aligned sequences. This cluster is then aligned to the next 
most related sequence or cluster of aligned sequences. Two clusters of sequences are 
aligned by a simple extension of the pairwise alignment of two individual sequences. The 
final alignment is achieved by a series of progressive, pairwise alignments. The program 
is run by designating specific sequences and their amino acid or nucleotide coordinates 

10 for regions of sequence comparison and by designating the program parameters. Using 
PILEUP, a reference sequence is compared to other test sequences to determine the 
percent sequence identity relationship using the following parameters: default gap weight 
(3.00), default gap length weight (0.10), and weighted end gaps. PILEUP can be obtained 
from the GCG sequence analysis software package, e.g., version 7.0 (Devereaux et al, 

15 Nuc Acids Res. 12:387-395 (1984). 

Another example of algorithm that is suitable for determining percent 
sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, 
which are described in Altschul et al, Nuc. Acids Res. 25:3389-3402 (1977) and Altschul 
et al, 1 Mol Biol 215:403-410 (1990), respectively. Software for performing BLAST 

20 analyses is publicly available through the National Center for Biotechnology Information 
(http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring 
sequence pairs (HSPs) by identifying short words of length W in the query sequence, 
which either match or satisfy some positive-valued threshold score T when aligned with a 
word of the same length in a database sequence. T is referred to as the neighborhood 

25 word score threshold (Altschul et al, supra). These initial neighborhood word hits act as 
seeds for initiating searches to find longer HSPs containing them. The word hits are 
extended in both directions along each sequence for as far as the cumulative alignment 
score can be increased. Cumulative scores are calculated using, for nucleotide sequences, 
the parameters M (reward score for a pair of matching residues; always > 0) and N 

30 (penalty score for mismatching residues; always < 0). For amino acid sequences, a 
scoring matrix is used to calculate the cumulative score. Extension of the word hits in 
each direction are halted when: the cumulative alignment score falls off by the quantity X 
from its maximum achieved value; the cumulative score goes to zero or below, due to the 
accumulation of one or more negative-scoring residue alignments; or the end of either 
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sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 
uses as defaults a wordlength (W) of 1 1, an expectation (E) or 10, M=5, N=-4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 
5 defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
(see Henikoff & Henikoff, Proc. Natl. Acad. Sci. USA 89:10915 (1989)) alignments (B) 
of 50, expectation (E) of 1 0, M=5, N=-4, and a comparison of both strands. 

The BLAST algorithm also performs a statistical analysis of the similarity 
between two sequences (see, e.g., Karlin & Altschul, Proc. Natl Acad. Sci. USA 

10 90:5873-5787 (1993)). One measure of similarity provided by the BLAST algorithm is 
the smallest sum probability (P(N)), which provides an indication of the probability by 
which a match between two nucleotide or amino acid sequences would occur by chance. 
For example, a nucleic acid is considered similar to a reference sequence if the smallest 
sum probability in a comparison of the test nucleic acid to the reference nucleic acid is 

1 5 less than about 0.2, more preferably less than about 0.01 , and most preferably less than 
about 6.001. 

An indication that two nucleic acid sequences or polypeptides are 
substantially identical is that the polypeptide encoded by the first nucleic acid is 
immunologically cross reactive with the antibodies raised against the polypeptide 

20 encoded by the second nucleic acid, as described below. Thus, a polypeptide is typically 
substantially identical to a second polypeptide, for example, where the two peptides differ 
only by conservative substitutions. Another indication that two nucleic acid sequences 
are substantially identical is that the two molecules or their complements hybridize to 
each other under stringent conditions, as described below. Yet another indication that 

25 two nucleic acid sequences are substantially identical is that the same primers can be used 
to amplify the sequence. 

The phrase "selectively (or specifically) hybridizes to" refers to the 
binding, duplexing, or hybridizing of a molecule only to a particular nucleotide sequence 
under stringent hybridization conditions when that sequence is present in a complex 

30 mixture (e.g., total cellular or library DNA or RNA). 

The phrase "stringent hybridization conditions" refers to conditions under 
which a probe will hybridize to its target subsequence, typically in a complex mixture of 
nucleic acid, but to no other sequences. Stringent conditions are sequence-dependent and 
will be different in different circumstances. Longer sequences hybridize specifically at 
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higher temperatures. An extensive guide to the hybridization of nucleic acids is found in 
Tijssen, Techniques in Biochemistry and Molecular Biology—Hybridization with Nucleic 
Probes, "Overview of principles of hybridization and the strategy of nucleic acid assays" 
(1993). Generally, stringent conditions are selected to be about 5-10°C lower than the 
5 thermal melting point (T m ) for the specific sequence at a defined ionic strength pH. The 
T m is the temperature (under defined ionic strength, pH, and nucleic concentration) at 
which 50% of the probes complementary to the target hybridize to the target sequence at 
equilibrium (as the target sequences are present in excess, at T m , 50% of the probes are 
occupied at equilibrium). Stringent conditions will be those in which the salt 

10 concentration is less than about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium 
ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 
30°C for short probes (e.g., 10 to 50 nucleotides) and at least about 60°C for long probes 
(e.g., greater than 50 nucleotides). Stringent conditions may also be achieved with the 
addition of destabilizing agents such as formamide. For selective or specific 

15 hybridization, a positive signal is at least two times background, optionally 10 times 
background hybridization. Exemplary stringent hybridization conditions can be as 
following: 50% formamide, 5x SSC, and 1% SDS, incubating at 42°C, or, 5x SSC, 1% 
SDS, incubating at 65°C, with wash in 0.2x SSC, and 0.1% SDS at 65°C. 

Nucleic acids that do not hybridize to each other under stringent conditions 

20 are still substantially identical if the polypeptides which they encode are substantially 
identical. This occurs, for example, when a copy of a nucleic acid is created using the 
maximum codon degeneracy permitted by the genetic code. In such cases, the nucleic 
acids typically hybridize under moderately stringent hybridization conditions. Exemplary 
"moderately stringent hybridization conditions" include a hybridization in a buffer of 

25 40% formamide, 1 M NaCl, 1% SDS at 37°C, and a wash in IX SSC at 45°C. A positive 
hybridization is at least twice background. Those of ordinary skill will readily recognize 
that alternative hybridization and wash conditions can be utilized to provide conditions of 
similar stringency. 

"Antibody" refers to a polypeptide comprising a framework region from 

30 an immunoglobulin gene or fragments thereof that specifically binds and recognizes an 
antigen. The recognized immunoglobulin genes include the kappa, lambda, alpha, 
gamma, delta, epsilon, and mu constant region genes, as well as the myriad 
immunoglobulin variable region genes. Light chains are classified as either kappa or 
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lambda. Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in turn 
define the immunoglobulin classes, IgG, IgM, IgA, IgD and IgE, respectively. 

An exemplary immunoglobulin (antibody) structural unit comprises a 
tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each 
5 pair having one "light" (about 25 kDa) and one "heavy" chain (about 50-70 kDa). The 
N-terminus of each chain defines a variable region of about 100 to 1 10 or more amino 
acids primarily responsible for antigen recognition. The terms variable light chain (Vl) 
and variable heavy chain (Vh) refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well- 

10 characterized fragments produced by digestion with various peptidases. Thus, for 

example, pepsin digests an antibody below the disulfide linkages in the hinge region to 
produce F(ab)'2, a dimer of Fab which itself is a light chain joined to V H -C H 1 by a 
disulfide bond. The F(ab)' 2 may be reduced under mild conditions to break the disulfide 
linkage in the hinge region, thereby converting the F(ab)'2 dimer into an Fab* monomer. 

1 5 The Fab' monomer is essentially Fab with part of the hinge region (see Fundamental 
Immunology (Paul ed., 3d ed. 1993). While various antibody fragments are defined in 
terms of the digestion of an intact antibody, one of skill will appreciate that such 
fragments may be synthesized de novo either chemically or by using recombinant DNA 
methodology. Thus, the term antibody, as used herein, also includes antibody fragments 

20 either produced by the modification of whole antibodies, or those synthesized de novo 
using recombinant DNA methodologies (e.g., single chain Fv) or those identified using 
phage display libraries {see, e.g., McCafferty et al, Nature 348:552-554 (1990)). 

For preparation of monoclonal or polyclonal antibodies, any technique 
known in the art can be used (see, e.g., Kohler & Milstein, Nature 256:495-497 (1975); 

25 Kozbor et al. , Immunology Today 4: 72 (1 983); Cole et al. , pp. 77-96 in Monoclonal 
Antibodies and Cancer Therapy (1985)). Techniques for the production of single chain 
antibodies (U.S. Patent 4,946,778) can be adapted to produce antibodies to polypeptides 
of this invention. Also, transgenic mice, or other organisms such as other mammals, may 
be used to express humanized antibodies. Alternatively, phage display technology can be 

30 used to identify antibodies and heteromeric Fab fragments that specifically bind to 

selected antigens (see, e.g., McCafferty et al, Nature 348:552-554 (1990); Marks et al, 
Biotechnology 10:779-783 (1992)). 

A "chimeric antibody" is an antibody molecule in which (a) the constant 
region, or a portion thereof, is altered, replaced or exchanged so that the antigen binding 
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site (variable region) is linked to a constant region of a different or altered class, effector 
function and/or species, or an entirely different molecule which confers new properties to 
the chimeric antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) 
the variable region, or a portion thereof, is altered, replaced or exchanged with a variable 
5 region having a different or altered antigen specificity. 

An "anti-GPCR-B3" antibody is an antibody or antibody fragment that 
specifically binds a polypeptide encoded by the GPCR-B3 gene, cDNA, or a subsequence 
thereof. 

The term "immunoassay" is an assay that uses an antibody to specifically 

10 bind an antigen. The immunoassay is characterized by the use of specific binding 
properties of a particular antibody to isolate, target, and/or quantify the antigen. 

The phrase "specifically (or selectively) binds" to an antibody or 
"specifically (or selectively) immunoreactive with," when referring to a protein or 
peptide, refers to a binding reaction that is determinative of the presence of the protein in 

15 a heterogeneous population of proteins and other biologies. Thus, under designated 

immunoassay conditions, the specified antibodies bind to a particular protein at least two 
times the background and do not substantially bind in a significant amount to other 
proteins present in the sample. Specific binding to an antibody under such conditions 
may require an antibody that is selected for its specificity for a particular protein. For 

20 example, polyclonal antibodies raised to GPCR-B3 from specific species such as rat, 
mouse, or human can be selected to obtain only those polyclonal antibodies that are 
specifically immunoreactive with GPCR-B3 and not with other proteins, except for 
polymorphic variants and alleles of GPCR-B3. This selection may be achieved by 
subtracting out antibodies that cross-react with GPCR-B3 molecules from other species. 

25 A variety of immunoassay formats may be used to select antibodies specifically 
immunoreactive with a particular protein. For example, solid-phase ELISA 
immunoassays are routinely used to select antibodies specifically immunoreactive with a 
protein (see, e.g., Harlow & Lane, Antibodies, A Laboratory Manual (1988), for a 
description of immunoassay formats and conditions that can be used to determine specific 

30 immunoreactivity). Typically a specific or selective reaction will be at least twice 

background signal or noise and more typically more than 10 to 100 times background. 

The phrase "selectively associates with" refers to the ability of a nucleic 
acid to "selectively hybridize" with another as defined above, or the ability of an antibody 
to "selectively (or specifically) bind to a protein, as defined above. 
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By "host cell" is meant a cell that contains an expression vector and 
supports the replication or expression of the expression vector. Host cells may be 
prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa and the like, e.g., cultured cells, explants, and cells 
5 in vivo, 

III. Isolation of the nucleic acid encoding GPCR-B3 

A. General recombinant DNA methods 

This invention relies on routine techniques in the field of recombinant 

10 genetics. Basic texts disclosing the general methods of use in this invention include 
Sambrook et al, Molecular Cloning, A Laboratory Manual (2nd ed. 1989); Kriegler, 
Gene Transfer and Expression: A Laboratory Manual (1990); and Current Protocols in 
Molecular Biology (Ausubel et al. 9 eds., 1994)). 

For nucleic acids, sizes are given in either kilobases (kb) or base pairs 

15 (bp). These are estimates derived from agarose or acrylamide gel electrophoresis, from 
sequenced nucleic acids, or from published DNA sequences. For proteins, sizes are given 
in kilodaltons (kDa) or amino acid residue numbers. Proteins sizes are estimated from gel 
electrophoresis, from sequenced proteins, from derived amino acid sequences, or from 
published protein sequences. 

20 Oligonucleotides that are not commercially available can be chemically 

synthesized according to the solid phase phosphoramidite triester method first described 
by Beaucage & Caruthers, Tetrahedron Letts, 22:1859-1862 (1981), using an automated 
synthesizer, as described in Van Devanter et. aL, Nucleic Acids Res. 12:6159-6168 
(1984). Purification of oligonucleotides is by either native acrylamide gel electrophoresis 

25 or by anion-exchange HPLC as described in Pearson & Reanier, J. Chrom. 255 : 1 37-149 
(1983). 

The sequence of the cloned genes and synthetic oligonucleotides can be 
verified after cloning using, e.g., the chain termination method for sequencing double- 
stranded templates of Wallace et al., Gene 16:21-26 (1981). 

30 

B. Cloning methods for the isolation of nucleotide sequences encoding 

GPCR-B3 

In general, the nucleic acid sequences encoding GPCR-B3 and related 
nucleic acid sequence homologs are cloned from cDNA and genomic DNA libraries by 
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hybridization with a probe, or isolated using amplification techniques with 
oligonucleotide primers. For example, GPCR-B3 sequences are typically isolated from 
mammalian nucleic acid (genomic or cDNA) libraries by hybridizing with a nucleic acid 
probe, the sequence of which can be derived from SEQ ID NOS:4-6. A suitable tissue 
5 from which GPCR-B3 RNA and cDNA can be isolated is tongue tissue, optionally taste 
bud tissue or individual taste cells. 

Amplification techniques using primers can also be used to amplify and 
isolate GPCR-B3 from DNA or RNA. The degenerate primers encoding the following 
amino acid sequences can also be used to amplify a sequence of GPCR-B3: SEQ ID 

10 NOS:7-8 (see, e.g., Dieffenfach & Dveksler, PCR Primer: A Laboratory Manual (1995)). 
These primers can be used, e.g., to amplify either the full length sequence or a probe of 
one to several hundred nucleotides, which is then used to screen a mammalian library for 
full-length GPCR-B3. 

Nucleic acids encoding GPCR-B3 can also be isolated from expression 

1 5 libraries using antibodies as probes. Such polyclonal or monoclonal antibodies can be 
raised using the sequence of SEQ ID NOS: 1-3. 

GPCR-B3 polymorphic variants, alleles, and interspecies homologs that 
are substantially identical to GPCR-B3 can be isolated using GPCR-B3 nucleic acid 
probes, and oligonucleotides under stringent hybridization conditions, by screening 

20 libraries. Alternatively, expression libraries can be used to clone GPCR-B3 and GPCR- 
B3 polymorphic variants, alleles, and interspecies homologs, by detecting expressed 
homologs immunologically with antisera or purified antibodies made against GPCR-B3, 
which also recognize and selectively bind to the GPCR-B3 homolog. 

To make a cDNA library, one should choose a source that is rich in 

25 GPCR-B3 mRNA, e.g., tongue tissue, or isolated taste buds. The mRNA is then made 
into cDNA using reverse transcriptase, ligated into a recombinant vector, and transfected 
into a recombinant host for propagation, screening and cloning. Methods for making and 
screening cDNA libraries are well known (see, e.g., Gubler & Hoffman, Gene 25:263-269 
(1983); Sambrook et at, supra; Ausubel et al, supra). 

30 For a genomic library, the DNA is extracted from the tissue and either 

mechanically sheared or enzymatically digested to yield fragments of about 12-20 kb. 
The fragments are then separated by gradient centrifugation from undesired sizes and are 
constructed in bacteriophage lambda vectors. These vectors and phage are packaged in 
vitro. Recombinant phage are analyzed by plaque hybridization as described in Benton & 
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Davis, Science 196:180-182 (1977). Colony hybridization is carried out as generally 
described in Grunstein et al, Proc. Natl. Acad. Sci. USA., 72:3961-3965 (1975). 

An alternative method of isolating GPCR-B3 nucleic acid and its 
homologs combines the use of synthetic oligonucleotide primers and amplification of an 
5 RNA or DNA template (see U.S. Patents 4,683,195 and 4,683,202; PCR Protocols: A 
Guide to Methods and Applications (Innis et al., eds, 1990)). Methods such as 
polymerase chain reaction (PCR) and ligase chain reaction (LCR) can be used to amplify 
nucleic acid sequences of GPCR-B3 directly from mRNA, from cDNA, from genomic 
libraries or cDNA libraries. Degenerate oligonucleotides can be designed to amplify 

10 GPCR-B3 homologs using the sequences provided herein. Restriction endonuclease sites 
can be incorporated into the primers. Polymerase chain reaction or other in vitro 
amplification methods may also be useful, for example, to clone nucleic acid sequences 
that code for proteins to be expressed, to make nucleic acids to use as probes for detecting 
the presence of GPCR-B3 encoding mRNA in physiological samples, for nucleic acid 

1 5 sequencing, or for other purposes. Genes amplified by the PCR reaction can be purified 
from agarose gels and cloned into an appropriate vector. 

Gene expression of GPCR-B3 can also be analyzed by techniques known 
in the art, e.g., reverse transcription and amplification of mRNA, isolation of total RNA 
or poly A + RNA, northern blotting, dot blotting, in situ hybridization, RNase protection, 

20 probing DNA microchip arrays, and the like. In one embodiment, high density 

oligonucleotide analysis technology (e.g., GeneChip™) is used to identify homologs and 
polymorphic variants of the GPCRs of the invention. In the case where the homologs 
being identified are linked to a known disease, they can be used with GeneChip™ as a 
diagnostic tool in detecting the disease in a biological sample, see, e.g., Gunthand et al, 

25 AIDS Res. Hum. Retroviruses 14: 869-876 (1998); Kozal et al, Nat Med. 2:753-759 
(1996); Matson etal, Anal Biochem. 224:1 10-106 (1995); Lockhart et al, Nat. 
Biotechnol 14:1675-1680 (1996); Gingeras et al, Genome Res. 8:435-448 (1998); Hacia 
et al, Nucleic Acids Res. 26:3865-3866 (1 998). 

Synthetic oligonucleotides can be used to construct recombinant GPCR-B3 

30 genes for use as probes or for expression of protein. This method is performed using a 
series of overlapping oligonucleotides usually 40-120 bp in length, representing both the 
sense and non-sense strands of the gene. These DNA fragments are then annealed, 
ligated and cloned. Alternatively, amplification techniques can be used with precise 
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primers to amplify a specific subsequence of the GPCR-B3 nucleic acid. The specific 
subsequence is then ligated into an expression vector. 

The nucleic acid encoding GPCR-B3 is typically cloned into intermediate 
vectors before transformation into prokaryotic or eukaryotic cells for replication and/or 
5 expression. These intermediate vectors are typically prokaryote vectors, e.g., plasmids, or 
shuttle vectors. 

Optionally, nucleic acids encoding chimeric proteins comprising GPCR- 
B3 or domains thereof can be made according to standard techniques. For example, a 
domain such as ligand binding domain, an extracellular domain, a transmembrane domain 

10 (e.g., one comprising seven transmembrane regions and cytosolic loops), the 

transmembrane domain and a cytoplasmic domain, an active site, a subunit association 
region, etc., can be covalently linked to a heterologous protein. For example, an 
extracellular domain can be linked to a heterologous GPCR transmembrane domain, or a 
heterologous GPCR extracellular domain can be linked to a transmembrane domain. 

15 Other heterologous proteins of choice include, e.g., green fluorescent protein, p-gal, 
glutamate receptor, and the rhodopsin presequence. 

C. Expression in prokaryotes and eukaryotes 

To obtain high level expression of a cloned gene or nucleic acid, such as 
20 those cDNAs encoding GPCR-B3, one typically subclones GPCR-B3 into an expression 
vector that contains a strong promoter to direct transcription, a transcription/translation 
terminator, and if for a nucleic acid encoding a protein, a ribosome binding site for 
translational initiation. Suitable bacterial promoters are well known in the art and 
described, e.g., in Sambrook et ah and Ausubel et ah Bacterial expression systems for 
25 expressing the GPCR-B3 protein are available in, e.g., E. coli, Bacillus sp. t and 

Salmonella (Palva et ah, Gene 22:229-235 (1983); Mosbach et ah, Nature 302:543-545 
(1983). Kits for such expression systems are commercially available. Eukaryotic 
expression systems for mammalian cells, yeast, and insect cells are well known in the art 
and are also commercially available. In one embodiment, the eukaryotic expression 
30 vector is an adenoviral vector, an adeno-associated vector, or a retroviral vector. 

The promoter used to direct expression of a heterologous nucleic acid 
depends on the particular application. The promoter is optionally positioned about the 
same distance from the heterologous transcription start site as it is from the transcription 
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start site in its natural setting. As is known in the art, however, some variation in this 
distance can be accommodated without loss of promoter function. 

In addition to the promoter, the expression vector typically contains a 
transcription unit or expression cassette that contains all the additional elements required 
5 for the expression of the GPCR-B3 encoding nucleic acid in host cells. A typical 

expression cassette thus contains a promoter operably linked to the nucleic acid sequence 
encoding GPCR-B3 and signals required for efficient polyadenylation of the transcript, 
ribosome binding sites, and translation termination. The nucleic acid sequence encoding 
GPCR-B3 may typically be linked to a cleavable signal peptide sequence to promote 

10 secretion of the encoded protein by the transformed cell. Such signal peptides would 

include, among others, the signal peptides from tissue plasminogen activator, insulin, and 
neuron growth factor, and juvenile hormone esterase of Heliothis virescens. Additional 
elements of the cassette may include enhancers and, if genomic DNA is used as the 
structural gene, introns with functional splice donor and acceptor sites. 

15 In addition to a promoter sequence, the expression cassette should also 

contain a transcription termination region downstream of the structural gene to provide 
for efficient termination. The termination region may be obtained from the same gene as 
the promoter sequence or may be obtained from different genes. 

The particular expression vector used to transport the genetic information 

20 into the cell is not particularly critical. Any of the conventional vectors used for 

expression in eukaryotic or prokaryotic cells may be used. Standard bacterial expression 
vectors include plasmids such as pBR322 based plasmids, pSKF, pET23D, and fusion 
expression systems such as GST and LacZ. Epitope tags can also be added to 
recombinant proteins to provide convenient methods of isolation, e.g., c-myc. 

25 Expression vectors containing regulatory elements from eukaryotic viruses 

are typically used in eukaryotic expression vectors, e.g., SV40 vectors, papilloma virus 
vectors, and vectors derived from Epstein-Barr virus. Other exemplary eukaryotic 
vectors include pMSG, pAV009/A + , pMTO10/A + , pMAMneo-5, baculovirus pDSVE, 
and any other vector allowing expression of proteins under the direction of the SV40 

30 early promoter, SV40 later promoter, metallothionein promoter, murine mammary tumor 
virus promoter, Rous sarcoma virus promoter, polyhedrin promoter, or other promoters 
shown effective for expression in eukaryotic cells. 

Some expression systems have markers that provide gene amplification 
such as thymidine kinase, hygromycin B phosphotransferase, and dihydrofolate reductase. 
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Alternatively, high yield expression systems not involving gene amplification are also 
suitable, such as using a baculovirus vector in insect cells, with a GPCR-B3 encoding 
sequence under the direction of the polyhedrin promoter or other strong baculovirus 
promoters. 

5 The elements that are typically included in expression vectors also include 

a replicon that functions in E. coli, a gene encoding antibiotic resistance to permit 
selection of bacteria that harbor recombinant plasmids, and unique restriction sites in 
nonessential regions of the plasmid to allow insertion of eukaryotic sequences. The 
particular antibiotic resistance gene chosen is not critical, any of the many resistance 

10 genes known in the art are suitable. The prokaryotic sequences are typically chosen such 
that they do not interfere with the replication of the DNA in eukaryotic cells, if necessary. 

Standard transfection methods are used to produce bacterial, mammalian, 
yeast or insect cell lines that express large quantities of GPCR-B3 protein, which are then 
purified using standard techniques (see, e.g., Colley et al, J. Biol Chem. 264:17619- 

1 5 1 7622 ( 1 989); Guide to Protein Purification, in Methods in Enzymology, vol. 1 82 
(Deutscher, ed., 1990)). Transformation of eukaryotic and prokaryotic cells are 
performed according to standard techniques (see, e.g., Morrison, J. Bact. 132:349-351 
(1977); Clark-Curtiss & Curtiss, Methods in Enzymology 101:347-362 (Wu et al. y eds, 
1983). 

20 Any of the well known procedures for introducing foreign nucleotide 

sequences into host cells may be used. These include the use of calcium phosphate 
transfection, polybrene, protoplast fusion, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors and any of the other well known methods for introducing 
cloned genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host 

25 cell (see f e.g., Sambrook et al. t supra). It is only necessary that the particular genetic 
engineering procedure used be capable of successfully introducing at least one gene into 
the host cell capable of expressing GPCR-B3. 

After the expression vector is introduced into the cells, the transfected cells 
are cultured under conditions favoring expression of GPCR-B3, which is recovered from 

30 the culture using standard techniques identified below. 

IV. Purification of GPCR-B3 

Either naturally occurring or recombinant GPCR-B3 can be purified for 
use in functional assays. Optionally, recombinant GPCR-B3 is purified. Naturally 
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occurring GPCR-B3 is purified, e.g., from mammalian tissue such as tongue tissue, and 
any other source of a GPCR-B3 homolog. Recombinant GPCR-B3 is purified from any 
suitable expression system, e.g., bacterial and eukaryotic expression systems, e.g., CHO 
cells or insect cells. 

5 GPCR-B3 may be purified to substantial purity by standard techniques, 

including selective precipitation with such substances as ammonium sulfate; column 
chromatography, immunopurification methods, and others {see, e.g., Scopes, Protein 
Purification: Principles and Practice (1982); U.S. Patent No. 4,673,641; Ausubel et al, 
supra; and Sambrook et al, supra). 

10 A number of procedures can be employed when recombinant GPCR-B3 is 

being purified. For example, proteins having established molecular adhesion properties 
can be reversible fused to GPCR-B3. With the appropriate ligand, GPCR-B3 can be 
selectively adsorbed to a purification column and then freed from the column in a 
relatively pure form. The fused protein is then removed by enzymatic activity. Finally 

1 5 GPCR-B3 could be purified using immunoaffinity columns. 

A. Purification of GPCR-B3 from recombinant cells 
Recombinant proteins are expressed by transformed bacteria or eukaryotic 
cells such as CHO or insect cells in large amounts, typically after promoter induction; but 
20 expression can be constitutive. Promoter induction with IPTG is a one example of an 
inducible promoter system. Cells are grown according to standard procedures in the art. 
Fresh or frozen cells are used for isolation of protein. 

Proteins expressed in bacteria may form insoluble aggregates ("inclusion 
bodies"). Several protocols are suitable for purification of GPCR-B3 inclusion bodies. 
25 For example, purification of inclusion bodies typically involves the extraction, separation 
and/or purification of inclusion bodies by disruption of bacterial cells, e.g., by incubation 
in a buffer of 50 mM TRIS/HCL pH 7.5, 50 mM NaCl, 5 mM MgCl 2 , 1 mM DTT, 0.1 
mM ATP, and 1 mM PMSF. The cell suspension can be lysed using 2-3 passages 
through a French Press, homogenized using a Polytron (Brinkman Instruments) or 
30 sonicated on ice. Alternate methods of lysing bacteria are apparent to those of skill in the 
art {see, e.g., Sambrook et al, supra; Ausubel et al, supra). 

If necessary, the inclusion bodies are solubilized, and the lysed cell 
suspension is typically centrifiiged to remove unwanted insoluble matter. Proteins that 
formed the inclusion bodies may be renatured by dilution or dialysis with a compatible 
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buffer. Suitable solvents include, but are not limited to urea (from about 4 M to about 8 
M), formamide (at least about 80%, volume/volume basis), and guanidine hydrochloride 
(from about 4 M to about 8 M). Some solvents which are capable of solubilizing 
aggregate-forming proteins, for example SDS (sodium dodecyl sulfate), 70% formic acid, 
5 are inappropriate for use in this procedure due to the possibility of irreversible 

denaturation of the proteins, accompanied by a lack of immunogenicity and/or activity. 
Although guanidine hydrochloride and similar agents are denaturants, this denaturation is 
not irreversible and renaturation may occur upon removal (by dialysis, for example) or 
dilution of the denaturant, allowing re-formation of immunologically and/or biologically 

10 active protein. Other suitable buffers are known to those skilled in the art. GPCR-B3 is 
separated from other bacterial proteins by standard separation techniques, e.g., with Ni- 
NTA agarose resin. 

Alternatively, it is possible to purify GPCR-B3 from bacteria periplasm. 
After lysis of the bacteria, when GPCR-B3 is exported into the periplasm of the bacteria, 

1 5 the periplasmic fraction of the bacteria can be isolated by cold osmotic shock in addition 
to other methods known to skill in the art. To isolate recombinant proteins from the 
periplasm, the bacterial cells are centrifiiged to form a pellet. The pellet is resuspended in 
a buffer containing 20% sucrose. To lyse the cells, the bacteria are centrifiiged and the 
pellet is resuspended in ice-cold 5 mM MgSC>4 and kept in an ice bath for approximately 

20 1 0 minutes. The cell suspension is centrifiiged and the supernatant decanted and saved. 
The recombinant proteins present in the supernatant can be separated from the host 
proteins by standard separation techniques well known to those of skill in the art. 

B. Standard protein separation techniques for purifying GPCR-B3 

25 Solubility fractionation 

Often as an initial step, particularly if the protein mixture is complex, an 
initial salt fractionation can separate many of the unwanted host cell proteins (or proteins 
derived from the cell culture media) from the recombinant protein of interest. In one 
embodiment, the salt is ammonium sulfate. Ammonium sulfate precipitates proteins by 

30 effectively reducing the amount of water in the protein mixture. Proteins then precipitate 
on the basis of their solubility. The more hydrophobic a protein is, the more likely it is to 
precipitate at lower ammonium sulfate concentrations. A typical protocol includes adding 
saturated ammonium sulfate to a protein solution so that the resultant ammonium sulfate 
concentration is between 20-30%. This concentration will precipitate the most 
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hydrophobic of proteins. The precipitate is then discarded (unless the protein of interest 
is hydrophobic) and ammonium sulfate is added to the supernatant to a concentration 
known to precipitate the protein of interest. The precipitate is then solubilized in buffer 
and the excess salt removed if necessary, either through dialysis or diafiltration. Other 
methods that rely on solubility of proteins, such as cold ethanol precipitation, are well 
known to those of skill in the art and can be used to fractionate complex protein mixtures. 

Size differential filtration 
The molecular weight of GPCR-B3 can be used to isolated it from proteins 
of greater and lesser size using ultrafiltration through membranes of different pore size 
(for example, Amicon or Millipore membranes). As a first step, the protein mixture is 
ultrafiltered through a membrane with a pore size that has a lower molecular weight cut- 
off than the molecular weight of the protein of interest. The retentate of the ultrafiltration 
is then ultrafiltered against a membrane with a molecular cut off greater than the 
molecular weight of the protein of interest. The recombinant protein will pass through 
the membrane into the filtrate. The filtrate can then be chromatographed as described 
below. 

Column chromatography 
GPCR-B3 can also be separated from other proteins on the basis of its size, 
net surface charge, hydrophobicity, and affinity for ligands. In addition, antibodies raised 
against proteins can be conjugated to column matrices and the proteins immunopurified. 
All of these methods are well known in the art. It will be apparent to one of skill that 
chromatographic techniques can be performed at any scale and using equipment from 
many different manufacturers (e.g., Pharmacia Biotech). 

V. Immunological detection of GPCR-B3 

In addition to the detection of GPCR-B3 genes and gene expression using 
nucleic acid hybridization technology, one can also use immunoassays to detect GPCR- 
B3, e.g., to identify taste receptor cells and variants of GPCR-B3. Immunoassays can be 
used to qualitatively or quantitatively analyze GPCR-B3. A general overview of the 
applicable technology can be found in Harlow & Lane, Antibodies: A Laboratory Manual 
(1988). 
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A. Antibodies to GPCR-B3 

Methods of producing polyclonal and monoclonal antibodies that react 
specifically with GPCR-B3 are known to those of skill in the art (see, e.g., Coligan, 
Current Protocols in Immunology (1991); Harlow & Lane, supra; Goding, Monoclonal 
5 Antibodies: Principles and Practice (2d ed. 1986); and Kohler & Milstein, Nature 
256:495-497 (1975). Such techniques include antibody preparation by selection of 
antibodies from libraries of recombinant antibodies in phage or similar vectors, as well as 
preparation of polyclonal and monoclonal antibodies by immunizing rabbits or mice (see, 
e.g., Huse et aL, Science 246:1275-1281 (1989); Ward et al. 9 Nature 341 :544-546 
10 (1989)). 

A number of GPCR-B3 comprising immunogens may be used to produce 
antibodies specifically reactive with GPCR-B3. For example, recombinant GPCR-B3 or 
an antigenic fragment thereof, is isolated as described herein. Recombinant protein can 
be expressed in eukaryotic or prokaryotic cells as described above, and purified as 

15 generally described above. Recombinant protein is one embodiment of an immunogen 
for the production of monoclonal or polyclonal antibodies. Alternatively, a synthetic 
peptide derived from the sequences disclosed herein and conjugated to a carrier protein 
can be used an immunogen. Naturally occurring protein may also be used either in pure 
or impure form. The product is then injected into an animal capable of producing 

20 antibodies. Either monoclonal or polyclonal antibodies may be generated, for subsequent 
use in immunoassays to measure the protein. 

Methods of production of polyclonal antibodies are known to those of skill 
in the art. An inbred strain of mice (e.g., BALB/C mice) or rabbits is immunized with the 
protein using a standard adjuvant, such as Freund's adjuvant, and a standard 

25 immunization protocol. The animal's immune response to the immunogen preparation is 
monitored by taking test bleeds and determining the titer of reactivity to GPCR-B3. 
When appropriately high titers of antibody to the immunogen are obtained, blood is 
collected from the animal and antisera are prepared. Further fractionation of the antisera 
to enrich for antibodies reactive to the protein can be done if desired (see Harlow & Lane, 

30 supra). 

Monoclonal antibodies may be obtained by various techniques familiar to 
those skilled in the art. Briefly, spleen cells from an animal immunized with a desired 
antigen are immortalized, commonly by fusion with a myeloma cell (see Kohler & 
Milstein, Eur. J. Immunol. 6:51 1-519 (1976)). Alternative methods of immortalization 
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include transformation with Epstein Barr Virus, oncogenes, or retroviruses, or other 
methods well known in the art. Colonies arising from single immortalized cells are 
screened for production of antibodies of the desired specificity and affinity for the 
antigen, and yield of the monoclonal antibodies produced by such cells may be enhanced 
5 by various techniques, including injection into the peritoneal cavity of a vertebrate host. 
Alternatively, one may isolate DNA sequences which encode a monoclonal antibody or a 
binding fragment thereof by screening a DNA library from human B cells according to 
the general protocol outlined by Huse et al. 9 Science 246:1275-1281 (1989). 

Monoclonal antibodies and polyclonal sera are collected and titered 

1 0 against the immunogen protein in an immunoassay, for example, a solid phase 

immunoassay with the immunogen immobilized on a solid support. Typically, polyclonal 
antisera with a titer of 10 4 or greater are selected and tested for their cross reactivity 
against non-GPCR-B3 proteins or even other related proteins from other organisms, using 
a competitive binding immunoassay. Specific polyclonal antisera and monoclonal 

1 5 antibodies will usually bind with a K<i of at least about 0. 1 mM, more usually at least 
about 1 ^M, optionally at least about 0.1 \M or better, and optionally 0.01 or better. 

Once GPCR-B3 specific antibodies are available, GPCR-B3 can be 
detected by a variety of immunoassay methods. For a review of immunological and 
immunoassay procedures, see Basic and Clinical Immunology (Stites & Terr eds., 7th ed. 

20 1991). Moreover, the immunoassays of the present invention can be performed in any of 
several configurations, which are reviewed extensively in Enzyme Immunoassay (Maggio, 
ed., 1980); and Harlow & Lane, supra. 

B. Immunological binding assays 

25 GPCR-B3 can be detected and/or quantified using any of a number of well 

recognized immunological binding assays {see, e.g., U.S. Patents 4,366,241; 4,376,110; 
4,517,288; and 4,837,168). For a review of the general immunoassays, see also Methods 
in Cell Biology: Antibodies in Cell Biology, volume 37 (Asai, ed. 1993); Basic and 
Clinical Immunology (Stites & Terr, eds., 7th ed. 1991). Immunological binding assays 

30 (or immunoassays) typically use an antibody that specifically binds to a protein or antigen 
of choice (in this case the GPCR-B3 or antigenic subsequence thereof). The antibody 
(e.g., anti-GPCR-B3) may be produced by any of a number of means well known to those 
of skill in the art and as described above. 
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Immunoassays also often use a labeling agent to specifically bind to and 
label the complex formed by the antibody and antigen. The labeling agent may itself be 
one of the moieties comprising the antibody/antigen complex. Thus, the labeling agent 
may be a labeled GPCR-B3 polypeptide or a labeled anti-GPCR-B3 antibody. 
Alternatively, the labeling agent may be a third moiety, such a secondary antibody, that 
specifically binds to the antibody/GPCR-B3 complex (a secondary antibody is typically 
specific to antibodies of the species from which the first antibody is derived). Other 
proteins capable of specifically binding immunoglobulin constant regions, such as protein 
A or protein G may also be used as the label agent. These proteins exhibit a strong non- 
immunogenic reactivity with immunoglobulin constant regions from a variety of species 
{see, e.g., Kronval et al, 1 Immunol 111:1401-1406(1973); Akerstrom e/ a/., /. 
Immunol 135:2589-2542 (1985)). The labeling agent can be modified with a detectable 
moiety, such as biotin, to which another molecule can specifically bind, such as 
streptavidin. A variety of detectable moieties are well known to those skilled in the art. 

Throughout the assays, incubation and/or washing steps may be required 
after each combination of reagents. Incubation steps can vary from about 5 seconds to 
several hours, optionally from about 5 minutes to about 24 hours. However, the 
incubation time will depend upon the assay format, antigen, volume of solution, 
concentrations, and the like. Usually, the assays will be carried out at ambient 
temperature, although they can be conducted over a range of temperatures, such as 10°C 
to40°C. 

Non-competitive assay formats 
Immunoassays for detecting GPCR-B3 in samples may be either 
competitive or noncompetitive. Noncompetitive immunoassays are assays in which the 
amount of antigen is directly measured. In one embodiment "sandwich" assay, for 
example, the anti-GPCR-B3 antibodies can be bound directly to a solid substrate on 
which they are immobilized. These immobilized antibodies then capture GPCR-B3 
present in the test sample. GPCR-B3 is thus immobilized is then bound by a labeling 
agent, such as a second GPCR-B3 antibody bearing a label. Alternatively, the second 
antibody may lack a label, but it may, in turn, be bound by a labeled third antibody 
specific to antibodies of the species from which the second antibody is derived. The 
second or third antibody is typically modified with a detectable moiety, such as biotin, to 
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which another molecule specifically binds, e.g., streptavidin, to provide a detectable 
moiety. 

Competitive assay formats 
5 In competitive assays, the amount of GPCR-B3 present in the sample is 

measured indirectly by measuring the amount of a known, added (exogenous) GPCR-B3 
displaced (competed away) from an anti-GPCR-B3 antibody by the unknown GPCR-B3 
present in a sample. In one competitive assay, a known amount of GPCR-B3 is added to 
a sample and the sample is then contacted with an antibody that specifically binds to 

10 GPCR-B3. The amount of exogenous GPCR-B3 bound to the antibody is inversely 

proportional to the concentration of GPCR-B3 present in the sample. In one embodiment, 
the antibody is immobilized on a solid substrate. The amount of GPCR-B3 bound to the 
antibody may be determined either by measuring the amount of GPCR-B3 present in a 
GPCR-B3/antibody complex, or alternatively by measuring the amount of remaining 

15 uncomplexed protein. The amount of GPCR-B3 may be detected by providing a labeled 
GPCR-B3 molecule. 

A hapten inhibition assay is another competitive assay. In this assay the 
known GPCR-B3, is immobilized on a solid substrate. A known amount of anti-GPCR- 
B3 antibody is added to the sample, and the sample is then contacted with the 

20 immobilized GPCR-B3. The amount of anti-GPCR-B3 antibody bound to the known 
immobilized GPCR-B3 is inversely proportional to the amount of GPCR-B3 present in 
the sample. Again, the amount of immobilized antibody may be detected by detecting 
either the immobilized fraction of antibody or the fraction of the antibody that remains in 
solution. Detection may be direct where the antibody is labeled or indirect by the 

25 subsequent addition of a labeled moiety that specifically binds to the antibody as 
described above. 

Cross-reactivity determinations 

Immunoassays in the competitive binding format can also be used for 
30 crossreactivity determinations. For example, a protein at least partially encoded by SEQ 
ID NOS:l-3 can be immobilized to a solid support. Proteins (e.g., GPCR-B3 proteins 
and homologs) are added to the assay that compete for binding of the antisera to the 
immobilized antigen. The ability of the added proteins to compete for binding of the 
antisera to the immobilized protein is compared to the ability of GPCR-B3 encoded by 
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SEQ ID NO: 1-3 to compete with itself. The percent crossreactivity for the above proteins 
is calculated, using standard calculations. Those antisera with less than 10% 
crossreactivity with each of the added proteins listed above are selected and pooled. The 
cross-reacting antibodies are optionally removed from the pooled antisera by 
5 immunoabsorption with the added considered proteins, e.g., distantly related homologs. 

The immunoabsprbed and pooled antisera are then used in a competitive 
binding immunoassay as described above to compare a second protein, thought to be 
perhaps an allele or polymorphic variant of GPCR-B3, to the immunogen protein (i.e., 
GPCR-B3 of SEQ ID NOS:l-3). In order to make this comparison, the two proteins are 

10 each assayed at a wide range of concentrations and the amount of each protein required to 
inhibit 50% of the binding of the antisera to the immobilized protein is determined. If the 
amount of the second protein required to inhibit 50% of binding is less than 10 times the 
amount of the protein encoded by SEQ ID NOS:l-3 that is required to inhibit 50% of 
binding, then the second protein is said to specifically bind to the polyclonal antibodies 

1 5 generated to a GPCR-B3 immunogen. 

Other assay formats 
Western blot (immunoblot) analysis is used to detect and quantify the 
presence of GPCR-B3 in the sample. The technique generally comprises separating 
20 sample proteins by gel electrophoresis on the basis of molecular weight, transferring the 
separated proteins to a suitable solid support, (such as a nitrocellulose filter, a nylon filter, 
or derivatized nylon filter), and incubating the sample with the antibodies that specifically 
bind GPCR-B3. The anti-GPCR-B3 antibodies specifically bind to the GPCR-B3 on the 
solid support. These antibodies may be directly labeled or alternatively may be 
25 subsequently detected using labeled antibodies (e.g., labeled sheep anti-mouse antibodies) 
that specifically bind to the anti-GPCR-B3 antibodies. 

Other assay formats include liposome immunoassays (LIA), which use 
liposomes designed to bind specific molecules (e.g., antibodies) and release encapsulated 
reagents or markers. The released chemicals are then detected according to standard 
30 techniques {see Monroe et al., Amer. Clin. Prod. Rev. 5:34-41 (1986)). 

Reduction of non-specific binding 
One of skill in the art will appreciate that it is often desirable to minimize 
non-specific binding in immunoassays. Particularly, where the assay involves an antigen 
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or antibody immobilized on a solid substrate it is desirable to minimize the amount of 
non-specific binding to the substrate. Means of reducing such non-specific binding are 
well known to those of skill in the art. Typically, this technique involves coating the 
substrate with a proteinaceous composition. In particular, protein compositions such as 
5 bovine serum albumin (BSA), nonfat powdered milk, and gelatin are widely used. 

Labels 

The particular label or detectable group used in the assay is not a critical 
aspect of the invention, as long as it does not significantly interfere with the specific 

10 binding of the antibody used in the assay. The detectable group can be any material 

having a detectable physical or chemical property. Such detectable labels have been well- 
developed in the field of immunoassays and, in general, most any label useful in such 
methods can be applied to the present invention. Thus, a label is any composition 
detectable by spectroscopic, photochemical, biochemical, immunochemical, electrical, 

15 optical or chemical means. Useful labels in the present invention include magnetic beads 
(e.g., DYNABEADS™), fluorescent dyes (e.g., fluorescein isothiocyanate, Texas red, 
rhodamine, and the like), radiolabels (e.g., 3 H, 125 1, 35 S, ,4 C, or 32 P), enzymes (e.g., horse 
radish peroxidase, alkaline phosphatase and others commonly used in an ELIS A), and 
colorimetric labels such as colloidal gold or colored glass or plastic beads (e.g., 

20 polystyrene, polypropylene, latex, etc.). 

The label may be coupled directly or indirectly to the desired component 
of the assay according to methods well known in the art. As indicated above, a wide 
variety of labels may be used, with the choice of label depending on sensitivity required, 
ease of conjugation with the compound, stability requirements, available instrumentation, 

25 and disposal provisions. 

Non-radioactive labels are often attached by indirect means. Generally, a 
ligand molecule (e.g., biotin) is covalently bound to the molecule. The ligand then binds 
to another molecules (e.g., streptavidin) molecule, which is either inherently detectable or 
covalently bound to a signal system, such as a detectable enzyme, a fluorescent 

30 compound, or a chemiluminescent compound. The ligands and their targets can be used 
in any suitable combination with antibodies that recognize GPCR-B3, or secondary 
antibodies that recognize anti-GPCR-B3 . 

The molecules can also be conjugated directly to signal generating 
compounds, e.g., by conjugation with an enzyme or fluorophore. Enzymes of interest as 
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labels will primarily be hydrolases, particularly phosphatases, esterases and glycosidases, 
or oxidotases, particularly peroxidases. Fluorescent compounds include fluorescein and 
its derivatives, rhodamine and its derivatives, dansyl, umbelliferone, etc; 
Chemiluminescent compounds include luciferin, and 2,3-dihydrophthalazinediones, e.g., 
5 luminol. For a review of various labeling or signal producing systems that may be used, 
see U.S. Patent No. 4,391,904. 

Means of detecting labels are well known to those of skill in the art. Thus, 
for example, where the label is a radioactive label, means for detection include a 
scintillation counter or photographic film as in autoradiography. Where the label is a 

1 0 fluorescent label, it may be detected by exciting the fluorochrome with the appropriate 
wavelength of light and detecting the resulting fluorescence. The fluorescence may be 
detected visually, by means of photographic film, by the use of electronic detectors such 
as charge coupled devices (CCDs) or photomultipliers and the like. Similarly, enzymatic 
labels may be detected by providing the appropriate substrates for the enzyme and 

15 detecting the resulting reaction product. Finally simple colorimetric labels may be 
detected simply by observing the color associated with the label. Thus, in various 
dipstick assays, conjugated gold often appears pink, while various conjugated beads 
appear the color of the bead. 

Some assay formats do not require the use of labeled components. For 

20 instance, agglutination assays can be used to detect the presence of the target antibodies. 
In this case, antigen-coated particles are agglutinated by samples comprising the target 
antibodies. In this format, none of the components need be labeled and the presence of 
the target antibody is detected by simple visual inspection. 

25 VI. Assays for modulators of GPCR-B3 

A. Assays for GPCR-B3 activity 

GPCR-B3 and its alleles and polymorphic variants are G-protein coupled 
receptors that participate in taste transduction. The activity of GPCR-B3 polypeptides 
can be assessed using a variety of in vitro and in vivo assays that determine functional, 
30 physical and chemical effects, e.g., measuring ligand binding (e.g., by radioactive ligand 
binding), second messengers (e.g., cAMP, cGMP, IP 3 , DAG, or Ca 2+ ), ion flux, 
phosphorylation levels, transcription levels, neurotransmitter levels, and the like. 
Furthermore, such assays can be used to test for inhibitors and activators of GPCR-B3. 
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Modulators can also be genetically altered versions of GPCR-B3. Such modulators of 
taste transduction activity are useful for customizing taste. 

The GPCR-B3 of the assay will be selected from a polypeptide having a 
sequence of SEQ ID NOS:l-3 or conservatively modified variant thereof. Alternatively, 
5 the GPCR-B3 of the assay will be derived from a eukaryote and include an amino acid 
subsequence having amino acid sequence identity SEQ ID NOS:l-3. Generally, the 
amino acid sequence identity will be at least 70%, optionally at least 85%, most 
optionally at least 90-95%. Optionally, the polypeptide of the assays will comprise a 
domain of GPCR-B3, such as an extracellular domain, transmembrane domain, 

10 cytoplasmic domain, ligand binding domain, subunit association domain, active site, and 
the like. Either GPCR-B3 or a domain thereof can be covalently linked to a heterologous 
protein to create a chimeric protein used in the assays described herein. 

Modulators of GPCR-B3 activity are tested using GPCR-B3 polypeptides 
as described above, either recombinant or naturally occurring. The protein can be 

15 isolated, expressed in a cell, expressed in a membrane derived from a cell, expressed in 
tissue or in an animal, either recombinant or naturally occurring. For example, tongue 
slices, dissociated cells from a tongue, transformed cells, or membranes can b used.. 
Modulation is tested using one of the in vitro or in vivo assays described herein. Taste 
transduction can also be examined in vitro with soluble or solid state reactions, using a 

20 chimeric molecule such as an extracellular domain of a receptor covalently linked to a 
heterologous signal transduction domain, or a heterologous extracellular domain 
covalently linked to the transmembrane and or cytoplasmic domain of a receptor. 
Furthermore, ligand-binding domains of the protein of interest can be used in vitro in 
soluble or solid state reactions to assay for ligand binding. 

25 Ligand binding to GPCR-B3, a domain, or chimeric protein can be tested 

in solution, in a bilayer membrane, attached to a solid phase, in a lipid monolayer, or in 
vesicles. Binding of a modulator can be tested using, e.g., changes in spectroscopic 
characteristics (e.g., fluorescence, absorbance, refractive index) hydrodynamic (e.g., 
shape), chromatographic, or solubility properties. 

30 Receptor-G-protein interactions can also be examined. For example, 

binding of the G-protein to the receptor or its release from the receptor can be examined. 
For example, in the absence of GTP, an activator will lead to the formation of a tight 
complex of a G protein (all three subunits) with the receptor. This complex can be 
detected in a variety of ways, as noted above. Such an assay can be modified to search 
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for inhibitors. Add an activator to the receptor and G protein in the absence of GTP, form 
a tight complex, and then screen for inhibitors by looking at dissociation of the receptor- 
G protein complex. In the presence of GTP, release of the alpha subunit of the G protein 
from the other two G protein subunits serves as a criterion of activation. 
5 An activated or inhibited G-protein will in turn alter the properties of 

target enzymes, channels, and other effector proteins. The classic examples are the 
activation of cGMP phosphodiesterase by transducin in the visual system, adenylate 
cyclase by the stimulatory G-protein, phospholipase C by Gq and other cognate G 
proteins, and modulation of diverse channels by Gi and other G proteins. Downstream 

10 consequences can also be examined such as generation of diacyl glycerol and IP3 by 
phospholipase C, and in turn, for calcium mobilization by IP3. 

Activated GPCR receptors become substrates for kinases that 
phosphorylate the C-terminal tail of the receptor (and possibly other sites as well). Thus, 
activators will promote the transfer of 32 P from gamma-labeled GTP to the receptor, 

1 5 which can be assayed with a scintillation counter. The phosphorylation of the C-terminal 
tail will promote the binding of arrestin-like proteins and will interfere with the binding of 
G-proteins. The kinase/arrestin pathway plays a key role in the desensitization of many 
GPCR receptors. For example, compounds that modulate the duration a taste receptor 
stays active would be useful as a means of prolonging a desired taste or cutting off an 

20 unpleasant one. For a general review of GPCR signal transduction and methods of 

assaying signal transduction, see, e.g., Methods in Enzymology, vols. 237 and 238 (1994) 
and volume 96 (1983); Bourne et al, Nature 10:349:1 17-27 (1991); Bourne et al., Nature 
348:125-32 (1990); Pitcher et al, Annu. Rev. Biochem. 67:653-92 (1998). 

Samples or assays that are treated with a potential GPCR-B3 inhibitor or 

25 activator are compared to control samples without the test compound, to examine the 
extent of modulation. Control samples (untreated with activators or inhibitors) are 
assigned a relative GPCR-B3 activity value of 100. Inhibition of GPCR-B3 is achieved 
when the GPCR-B3 activity value relative to the control is about 90%, optionally 50%, 
optionally 25-0%. Activation of GPCR-B3 is achieved when the GPCR-B3 activity value 

30 relative to the control is 110%, optionally 150%, 200-500%, or 1000-2000%. 

Changes in ion flux may be assessed by determining changes in 
polarization (i.e., electrical potential) of the cell or membrane expressing GPCR-B3. One 
means to determine changes in cellular polarization is by measuring changes in current 
(thereby measuring changes in polarization) with voltage-clamp and patch-clamp 
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techniques, e.g., the "cell-attached" mode, the "inside-out" mode, and the "whole cell" 
mode (see, e.g., Ackerman etal. t New Engl. J. Med. 336:1575-1595 (1997)). Whole cell 
currents are conveniently determined using the standard methodology (see, e.g., Hamil et 
al, PFlugers. Archiv. 391:85 (1981). Other known assays include: radiolabeled ion flux 
5 assays and fluorescence assays using voltage-sensitive dyes (see, e.g., Vestergarrd- 

Bogind et al, J. Membrane Biol. 88:67-75 (1988); Gonzales & Tsien, Chem. Biol. 4:269- 
277 (1997); Daniel et al, J. Pharmacol. Meth. 25:185-193 (1991); Holevinsky et al, J. 
Membrane Biology 137:59-70 (1994)). Generally, the compounds to be tested are present 
in the range from 1 pM to 100 mM. 

1 0 The effects of the test compounds upon the function of the polypeptides 

can be measured by examining any of the parameters described above. Any suitable 
physiological change that affects GPCR activity can be used to assess the influence of a 
test compound on the polypeptides of this invention. When the functional consequences 
are determined using intact cells or animals, one can also measure a variety of effects 

15 such as transmitter release, hormone release, transcriptional changes to both known and 
uncharacterized genetic markers (e.g., northern blots), changes in cell metabolism such as 
cell growth or pH changes, and changes in intracellular second messengers such as Ca 2+ , 
IP3orcAMP. 

Assays for G-protein coupled receptors include cells that are loaded with 
20 ion or voltage sensitive dyes to report receptor activity. Assays for determining activity 
of such receptors can also use known agonists and antagonists for other G-protein 
coupled receptors as negative or positive controls to assess activity of tested compounds. 
In assays for identifying modulatory compounds (e.g., agonists, antagonists), changes in 
the level of ions in the cytoplasm or membrane voltage will be monitored using an ion - 
25 sensitive or membrane voltage fluorescent indicator, respectively. Among the ion- 
sensitive indicators and voltage probes that may be employed are those disclosed in the 
Molecular Probes 1997 Catalog. For G-protein coupled receptors, promiscuous G- 
proteins such as Gal5 and Gal6 can be used in the assay of choice (Wilkie et al, Proc. 
NaflAcad. Sci. USA 88:10049-10053 (1991)). Such promiscuous G-proteins allow 
30 coupling of a wide range of receptors. 

Receptor activation typically initiates subsequent intracellular events, e.g., 
increases in second messengers such as IP3, which releases intracellular stores of calcium 
ions. Activation of some G-protein coupled receptors stimulates the formation of inositol 
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triphosphate (IP3) through phospholipase C-mediated hydrolysis of phosphatidylinositol 
(Berridge & Irvine, Nature 312:315-21 (1984)). IP3 in turn stimulates the release of 
intracellular calcium ion stores. Thus, a change in cytoplasmic calcium ion levels, or a 
change in second messenger levels such as IP3 can be used to assess G-protein coupled 
5 receptor function. Cells expressing such G-protein coupled receptors may exhibit 
increased cytoplasmic calcium levels as a result of contribution from both intracellular 
stores and via activation of ion channels, in which case it may be desirable although not 
necessary to conduct such assays in calcium-free buffer, optionally supplemented with a 
chelating agent such as EGTA, to distinguish fluorescence response resulting from 

10 calcium release from internal stores. 

Other assays can involve determining the activity of receptors which, 
when activated, result in a change in the level of intracellular cyclic nucleotides, e.g., 
cAMP or cGMP, by activating or inhibiting enzymes such as adenylate cyclase. There 
are cyclic nucleotide-gated ion channels, e.g., rod photoreceptor cell channels and 

15 olfactory neuron channels that are permeable to cations upon activation by binding of 
cAMP or cGMP {see, e.g., Altenhofen et al, Proc. Natl Acad. ScL U.S.A. 88:9868-9872 
(1991) and Dhallan et al t Nature 347:184-187 (1990)). In cases where activation of the 
receptor results in a decrease in cyclic nucleotide levels, it may be preferable to expose 
the cells to agents that increase intracellular cyclic nucleotide levels, e.g., forskolin, prior 

20 to adding a receptor-activating compound to the cells in the assay. Cells for this type of 
assay can be made by co-transfection of a host cell with DNA encoding a cyclic 
nucleotide-crated ion channel, GPCR phosphatase and DNA encoding a receptor (e.g., 
certain glutamate receptors, muscarinic acetylcholine receptors, dopamine receptors, 
serotonin receptors, and the like), which, when activated, causes a change in cyclic 

25 nucleotide levels in the cytoplasm. 

In one embodiment, GPCR-B3 activity is measured by expressing GPCR- 
B3 in a heterologous cell with a promiscuous G-protein that links the receptor to a 
phospholipase C signal transduction pathway {see Offermanns & Simon, J. Biol. Chem. 
270:15175-15180 (1995)). Optionally the cell line is HEK-293 (which does not naturally 

30 express GPCR-B3) and the promiscuous G-protein is Gal 5 (Offermanns & Simon, 

supra). Modulation of taste transduction is assayed by measuring changes in intracellular 
Ca 2+ levels, which change in response to modulation of the GPCR-B3 signal transduction 
pathway via administration of a molecule that associates with GPCR-B3. Changes in 
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Ca 2+ levels are optionally measured using fluorescent Ca 2+ indicator dyes and 
fluorometric imaging. 

In one embodiment, the changes in intracellular cAMP or cGMP can be 
measured using immunoassays. The method described in Offermanns & Simon, J. Biol 
Chem. 270:15175-15180 (1995) may be used to determine the level of cAMP. Also, the 
method described in Felley-Bosco et al. 9 Am. J. Resp. Cell andMol Biol 1 1:159-164 
(1994) may be used to determine the level of cGMP. Further, an assay kit for measuring 
cAMP and/or cGMP is described in U.S. Patent 4,115,538, herein incorporated by 
reference. 

In another embodiment, phosphatidyl inositol (PI) hydrolysis can be 
analyzed according to U.S. Patent 5,436,128, herein incorporated by reference. Briefly, 
the assay involves labeling of cells with 3 H-myoinositol for 48 or more hrs. The labeled 
cells are treated with a test compound for one hour. The treated cells are lysed and 
extracted in chloroform-methanol-water after which the inositol phosphates were 
separated by ion exchange chromatography and quantified by scintillation counting. Fold 
stimulation is determined by calculating the ratio of cpm in the presence of agonist to 
cpm in the presence of buffer control. Likewise, fold inhibition is determined by 
calculating the ratio of cpm in the presence of antagonist to cpm in the presence of buffer 
control (which may or may not contain an agonist). 

In another embodiment, transcription levels can be measured to assess the 
effects of a test compound on signal transduction. A host cell containing the protein of 
interest is contacted with a test compound for a sufficient time to effect any interactions, 
and then the level of gene expression is measured. The amount of time to effect such 
interactions may be empirically determined, such as by running a time course and 
measuring the level of transcription as a function of time. The amount of transcription 
may be measured by using any method known to those of skill in the art to be suitable. 
For example, mRNA expression of the protein of interest may be detected using northern 
blots or their polypeptide products may be identified using immunoassays. Alternatively, 
transcription based assays using reporter gene may be used as described in U.S. Patent 
5,436,128, herein incorporated by reference. The reporter genes can be, e.g., 
chloramphenicol acetyltransferase, firefly luciferase, bacterial luciferase, P-galactosidase 
and alkaline phosphatase. Furthermore, the protein of interest can be used as an indirect 
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reporter via attachment to a second reporter such as green fluorescent protein (see, e.g., 
Mistili & Spector, Nature Biotechnology 15:961-964 (1997)). 

The amount of transcription is then compared to the amount of 
transcription in either the same cell in the absence of the test compound, or it may be 
5 compared with the amount of transcription in a substantially identical cell that lacks the 
protein of interest. A substantially identical cell may be derived from the same cells from 
which the recombinant cell was prepared but which had not been modified by 
introduction of heterologous DNA. Any difference in the amount of transcription 
indicates that the test compound has in some manner altered the activity of the protein of 
10 interest. 



B. Modulators 

The compounds tested as modulators of GPCR-B3 can be any small 
chemical compound, or a biological entity, such as a protein, sugar, nucleic acid or lipid. 

15 Alternatively, modulators can be genetically altered versions of GPCR-B3. Typically, 
test compounds will be small chemical molecules and peptides. Essentially any chemical 
compound can be used as a potential modulator or ligand in the assays of the invention, 
although most often compounds can be dissolved in aqueous or organic (especially 
DMSO-based) solutions are used. The assays are designed to screen large chemical 

20 libraries by automating the assay steps and providing compounds from any convenient 
source to assays, which are typically run in parallel (e.g., in microtiter formats on 
micro titer plates in robotic assays). It will be appreciated that there are many suppliers of 
chemical compounds, including Sigma (St. Louis, MO), Aldrich (St. Louis, MO), Sigma- 
Aldrich (St. Louis, MO), Fluka Chemika-Biochemica Analytika (Buchs Switzerland) and 

25 the like. 

In one embodiment, high throughput screening methods involve providing 
a combinatorial chemical or peptide library containing a large number of potential 
therapeutic compounds (potential modulator or ligand compounds). Such "combinatorial 
chemical libraries" or "ligand libraries" are then screened in one or more assays, as 
30 described herein, to identify those library members (particular chemical species or 

subclasses) that display a desired characteristic activity. The compounds thus identified 
can serve as conventional "lead compounds" or can themselves be used as potential or 
actual therapeutics. 
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A combinatorial chemical library is a collection of diverse chemical 
compounds generated by either chemical synthesis or biological synthesis, by combining 
a number of chemical "building blocks" such as reagents. For example, a linear 
combinatorial chemical library such as a polypeptide library is formed by combining a set 
5 of chemical building blocks (amino acids) in every possible way for a given compound 
length (i.e., the number of amino acids in a polypeptide compound). Millions of chemical 
compounds can be synthesized through such combinatorial mixing of chemical building 
blocks. 

Preparation and screening of combinatorial chemical libraries is well 

10 known to those of skill in the art. Such combinatorial chemical libraries include, but are 
not limited to, peptide libraries (see, e.g., U.S. Patent 5,010,175, Furka, Int. J. Pept. Prot. 
Res. 37:487-493 (1991) and Houghton et al. y Nature 354:84-88 (1991)). Other 
chemistries for generating chemical diversity libraries can also be used. Such chemistries 
include, but are not limited to: peptoids (e.g., PCT Publication No. WO 91/19735), 

15 encoded peptides (e.g., PCT Publication WO 93/20242), random bio-oligomers (e.g., 
PCT Publication No. WO 92/00091), benzodiazepines (e.g., U.S. Pat. No. 5,288,514), 
diversomers such as hydantoins, benzodiazepines and dipeptides (Hobbs et aL % Proc. Nat. 
Acad. Sci. USA 90:6909-6913 (1993)), vinylogous polypeptides (Hagihara etal, J. Amer. 
Chem. Soc. 1 14:6568 (1992)), nonpeptidal peptidomimetics with glucose scaffolding 

20 (Hirschmaim et al., J. Amer. Chem. Soc. 114:9217-9218 (1992)), analogous organic 
syntheses of small compound libraries (Chen et aL, J. Amer. Chem. Soc. 1 16:2661 
(1994)), oligocarbamates (Cho et ai 9 Science 261:1303 (1993)), and/or peptidyl 
phosphonates (Campbell et al, J. Org. Chem. 59:658 (1994)), nucleic acid libraries (see 
Ausubel, Berger and Sambrook, all supra), peptide nucleic acid libraries (see, e.g., U.S. 

25 Patent 5,539,083), antibody libraries (see, e.g., Vaughn et al. 9 Nature Biotechnology, 
14(3):309-314 (1996) and PCT/US96/10287), carbohydrate libraries (see, e.g., Liang et 
ai, Science, 274:1520-1522 (1996) and U.S. Patent 5,593,853), small organic molecule 
libraries (see, e.g., benzodiazepines, Baum C&EN, Jan 18, page 33 (1993); isoprenoids, 
U.S. Patent 5,569,588; thiazolidinones and metathiazanones, U.S. Patent 5,549,974; 

30 pyrrolidines, U.S. Patents 5,525,735 and 5,519,134; morpholino compounds, U.S. Patent 
5,506,337; benzodiazepines, 5,288,514, and the like). 

Devices for the preparation of combinatorial libraries are commercially 
available (see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, 
Symphony, Rainin, Woburn, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, 
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Millipore, Bedford, MA). In addition, numerous combinatorial libraries are themselves 
commercially available (see, e.g., ComGenex, Princeton, N.J., Tripos, Inc., St. Louis, 
MO, 3D Pharmaceuticals, Exton, PA, Martek Biosciences, Columbia, MD, etc.). 

5 C. Solid State and soluble high throughput assays 

In one embodiment the invention provide soluble assays using molecules 
such as a domain such as ligand binding domain, an extracellular domain, a 
transmembrane domain (e.g., one comprising seven transmembrane regions and cytosolic 
loops), the transmembrane domain and a cytoplasmic domain, an active site, a subunit 

10 association region, etc.; a domain that is covalently linked to a heterologous protein to 
create a chimeric molecule; GPCR-B3; or a cell or tissue expressing GPCR-B3, either 
naturally occurring or recombinant. In another embodiment, the invention provides solid 
phase based in vitro assays in a high throughput format, where the domain, chimeric 
molecule, GPCR-B3, or cell or tissue expressing GPCR-B3 is attached to a solid phase 

15 substrate. 

In the high throughput assays of the invention, it is possible to screen up to 
several thousand different modulators or ligands in a single day. In particular, each well 
of a microtiter plate can be used to run a separate assay against a selected potential 
modulator, or, if concentration or incubation time effects are to be observed, every 5-10 

20 wells can test a single modulator. Thus, a single standard microtiter plate can assay about 
100 (e.g., 96) modulators. If 1536 well plates are used, then a single plate can easily 
assay from about 100- about 1500 different compounds. It is possible to assay several 
different plates per day; assay screens for up to about 6,000-20,000 different compounds 
is possible using the integrated systems of the invention. More recently, microfluidic 

25 approaches to reagent manipulation have been developed, e.g., by Caliper Technologies 
(Palo Alto, CA). 

The molecule of interest can be bound to the solid state component, 
directly or indirectly, via covalent or non covalent linkage e.g., via a tag. The tag can be 
any of a variety of components. In general, a molecule which binds the tag (a tag binder) 
30 is fixed to a solid support, and the tagged molecule of interest (e.g., the taste transduction 
molecule of interest) is attached to the solid support by interaction of the tag and the tag 
binder. 

A number of tags and tag binders can be used, based upon known 
molecular interactions well described in the literature. For example, where a tag has a 
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natural binder, for example, biotin, protein A, or protein G, it can be used in conjunction 
with appropriate tag binders (avidin, streptavidin, neutravidin, the Fc region of an 
immunoglobulin, etc.) Antibodies to molecules with natural binders such as biotin are 
also widely available and appropriate tag binders; see, SIGMA Immunochemicals 1998 
5 catalogue SIGMA, St. Louis MO). 

Similarly, any haptenic or antigenic compound can be used in combination 
with an appropriate antibody to form a tag/tag binder pair. Thousands of specific 
antibodies are commercially available and many additional antibodies are described in the 
literature. For example, in one common configuration, the tag is a first antibody and the 

10 tag binder is a second antibody which recognizes the first antibody. In addition to 

antibody-antigen interactions, receptor-ligand interactions are also appropriate as tag and 
tag-binder pairs. For example, agonists and antagonists of cell membrane receptors (e.g., 
cell receptor-ligand interactions such as transferrin, c-kit, viral receptor ligands, cytokine 
receptors, chemokine receptors, interleukin receptors, immunoglobulin receptors and 

15 antibodies, the cadherein family, the integrin family, the selectin family, and the like; see, 
e.g., Pigott & Power, The Adhesion Molecule Facts Book I (1993). Similarly, toxins and 
venoms, viral epitopes, hormones (e.g., opiates, steroids, etc.), intracellular receptors (e.g. 
which mediate the effects of various small ligands, including steroids, thyroid hormone, 
retinoids and vitamin D; peptides), drugs, lectins, sugars, nucleic acids (both linear and 

20 cyclic polymer configurations), oligosaccharides, proteins, phospholipids and antibodies 
can all interact with various cell receptors. 

Synthetic polymers, such as polyurethanes, polyesters, polycarbonates, 
polyureas, polyamides, polyethyleneimines, polyarylene sulfides, polysiloxanes, 
polyimides, and polyacetates can also form an appropriate tag or tag binder. Many other 

25 tag/tag binder pairs are also useful in assay systems described herein, as would be 
apparent to one of skill upon review of this disclosure. 

Common linkers such as peptides, polyethers, and the like can also serve 
as tags, and include polypeptide sequences, such as poly gly sequences of between about 
5 and 200 amino acids. Such flexible linkers are known to persons of skill in the art. For 

30 example, poly(ethelyne glycol) linkers are available from Shearwater Polymers, Inc. 

Huntsville, Alabama. These linkers optionally have amide linkages, sulfhydryl linkages, 
or hetero functional linkages. 

Tag binders are fixed to solid substrates using any of a variety of methods 
currently available. Solid substrates are commonly derivatized or functionalized by 
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exposing all or a portion of the substrate to a chemical reagent which fixes a chemical 
group to the surface which is reactive with a portion of the tag binder. For example, 
groups which are suitable for attachment to a longer chain portion would include amines, 
hydroxyl, thiol, and carboxyl groups. Aminoalkylsilanes and hydroxyalkylsilanes can be 
5 used to functionalize a variety of surfaces, such as glass surfaces. The construction of 
such solid phase biopolymer arrays is well described in the literature. See, e.g., 
Merrifield, 1 Am. Chem. Soc. 85:2149-2154 (1963) (describing solid phase synthesis of, 
e.g., peptides); Geysen et al, J. Immun. Meth. 102:259-274 (1987) (describing synthesis 
of solid phase components on pins); Frank & Doring, Tetrahedron 44:60316040 (1988) 

10 (describing synthesis of various peptide sequences on cellulose disks); Fodor et al, 

Science, 251:767-777 (1991); Sheldon et al, Clinical Chemistry 39(4):718-719 (1993); 
and Kozal et al., Nature Medicine 2(7):753759 (1996) (all describing arrays of 
biopolymers fixed to solid substrates). Non-chemical approaches for fixing tag binders to 
substrates include other common methods, such as heat, cross-linking by UV radiation, 

15 and the like. 

D. Computer-based assays 

Yet another assay for compounds that modulate GPCR-B3 activity 
involves computer assisted drug design, in which a computer system is used to generate a 

20 three-dimensional structure of GPCR-B3 based on the structural information encoded by 
the amino acid sequence. The input amino acid sequence interacts directly and actively 
with a preestablished algorithm in a computer program to yield secondary, tertiary, and 
quaternary structural models of the protein. The models of the protein structure are then 
examined to identify regions of the structure that have the ability to bind, e.g., ligands. 

25 These regions are then used to identify ligands that bind to the protein. 

The three-dimensional structural model of the protein is generated by 
entering protein amino acid sequences of at least 10 amino acid residues or corresponding 
nucleic acid sequences encoding a GPCR-B3 polypeptide into the computer system. The 
amino acid sequence of the polypeptide of the nucleic acid encoding the polypeptide is 

30 selected from the group consisting of SEQ ID NOS: 1 -3 or SEQ ID NOS :4-6 and 
conservatively modified versions thereof. The amino acid sequence represents the 
primary sequence or subsequence of the protein, which encodes the structural information 
of the protein. At least 10 residues of the amino acid sequence (or a nucleotide sequence 
encoding 10 amino acids) are entered into the computer system from computer 
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keyboards, computer readable substrates that include, but are not limited to, electronic 
storage media (e.g., magnetic diskettes, tapes, cartridges, and chips), optical media (e.g., 
CD ROM), information distributed by internet sites, and by RAM. The three-dimensional 
structural model of the protein is then generated by the interaction of the amino acid 
5 sequence and the computer system, using software known to those of skill in the art. 

The amino acid sequence represents a primary structure that encodes the 
information necessary to form the secondary, tertiary and quaternary structure of the 
protein of interest. The software looks at certain parameters encoded by the primary 
sequence to generate the structural model. These parameters are referred to as "energy 

1 0 terms," and primarily include electrostatic potentials, hydrophobic potentials, solvent 
accessible surfaces, and hydrogen bonding. Secondary energy terms include van der 
Waals potentials. Biological molecules form the structures that minimize the energy 
terms in a cumulative fashion. The computer program is therefore using these terms 
encoded by the primary structure or amino acid sequence to create the secondary 

15 structural model. 

The tertiary structure of the protein encoded by the secondary structure is 
then formed on the basis of the energy terms of the secondary structure. The user at this 
point can enter additional variables such as whether the protein is membrane bound or 
soluble, its location in the body, and its cellular location, e.g., cytoplasmic, surface, or 

20 nuclear. These variables along with the energy terms of the secondary structure are used 
to form the model of the tertiary structure. In modeling the tertiary structure, the 
computer program matches hydrophobic faces of secondary structure with like, and 
hydrophilic faces of secondary structure with like. 

Once the structure has been generated, potential ligand binding regions are 

25 identified by the computer system. Three-dimensional structures for potential ligands are 
generated by entering amino acid or nucleotide sequences or chemical formulas of 
compounds, as described above. The three-dimensional structure of the potential ligand 
is then compared to that of the GPCR-B3 protein to identify ligands that bind to GPCR- 
B3. Binding affinity between the protein and ligands is determined using energy terms to 

30 determine which ligands have an enhanced probability of binding to the protein. 

Computer systems are also used to screen for mutations, polymorphic 
variants, alleles and interspecies homologs of GPCR-B3 genes. Such mutations can be 
associated with disease states or genetic traits. As described above, GeneChip™ and 
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related technology can also be used to screen for mutations, polymorphic variants, alleles 
and interspecies homologs. Once the variants are identified, diagnostic assays can be 
used to identify patients having such mutated genes. Identification of the mutated GPCR- 
B3 genes involves receiving input of a first nucleic acid or amino acid sequence encoding 
5 GPCR-B3, selected from the group consisting of SEQ ID NOS: 1 -3, or SEQ ID NOS :4-6 
and conservatively modified versions thereof. The sequence is entered into the computer 
system as described above. The first nucleic acid or amino acid sequence is then 
compared to a second nucleic acid or amino acid sequence that has substantial identity to 
the first sequence. The second sequence is entered into the computer system in the 
10 manner described above. Once the first and second sequences are compared, nucleotide 
or amino acid differences between the sequences are identified. Such sequences can 
represent allelic differences in GPCR-B3 genes, and mutations associated with disease 
states and genetic traits. 

15 VIII. Kits 

GPCR-B3 and its homologs are a useful tool for identifying taste receptor 
cells, for forensics and paternity determinations, and for examining taste transduction. 
GPCR-B3 specific reagents that specifically hybridize to GPCR-B3 nucleic acid, such as 
GPCR-B3 probes and primers, and GPCR-B3 specific reagents that specifically bind to 

20 the GPCR-B3 protein, e.g., GPCR-B3 antibodies are used to examine taste cell expression 
and taste transduction regulation. 

Nucleic acid assays for the presence of GPCR-B3 DNA and RNA in a 
sample include numerous techniques are known to those skilled in the art, such as 
Southern analysis, northern analysis, dot blots, RNase protection, SI analysis, 

25 amplification techniques such as PCR and LCR, and in situ hybridization. In in situ 
hybridization, for example, the target nucleic acid is liberated from its cellular 
surroundings in such as to be available for hybridization within the cell while preserving 
the cellular morphology for subsequent interpretation and analysis. The following articles 
provide an overview of the art of in situ hybridization: Singer et ai, Biotechniques 4:230- 

30 250 (1986); Haase et al, Methods in Virology, vol. VII, pp. 189-226 (1984); and Nucleic 
Acid Hybridization: A Practical Approach (Hames et al. 9 eds. 1987). In addition, GPCR- 
B3 protein can be detected with the various immunoassay techniques described above. 
The test sample is typically compared to both a positive control (e.g., a sample expressing 
recombinant GPCR-B3) and a negative control. 
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The present invention also provides for kits for screening for modulators 
of GPCR-B3. Such kits can be prepared from readily available materials and reagents. 
For example, such kits can comprise any one or more of the following materials: GPCR- 
B3, reaction tubes, and instructions for testing GPCR-B3 activity. Optionally, the kit 
5 contains biologically active GPCR-B3. A wide variety of kits and components can be 
prepared according to the present invention, depending upon the intended user of the kit 
and the particular needs of the user. 

IX. Administration and pharmaceutical compositions 

10 Taste modulators can be administered directly to the mammalian subject 

for modulation of taste in vivo. Administration is by any of the routes normally used for 
introducing a modulator compound into ultimate contact with the tissue to be treated, e.g., 
the tongue or mouth. The taste modulators are administered in any suitable manner, 
optionally with pharmaceutically acceptable carriers. Suitable methods of administering 

1 5 such modulators are available and well known to those of skill in the art, and, although 
more than one route can be used to administer a particular composition, a particular route 
can often provide a more immediate and more effective reaction than another route. 

Pharmaceutically acceptable carriers are determined in part by the 
particular composition being administered, as well as by the particular method used to 

20 administer the composition. Accordingly, there is a wide variety of suitable formulations 
of pharmaceutical compositions of the present invention (see, e.g., Remington 's 
Pharmaceutical Sciences, 17 th ed. 1985)). 

The taste modulators, alone or in combination with other suitable 
components, can be made into aerosol formulations (i.e., they can be Nebulized") to be 

25 administered via inhalation. Aerosol formulations can be placed into pressurized 

acceptable propellants, such as dichlorodifluoromethane, propane, nitrogen, and the like. 

Formulations suitable for administration include aqueous and non-aqueous 
solutions, isotonic sterile solutions, which can contain antioxidants, buffers, bacteriostats, 
and solutes that render the formulation isotonic, and aqueous and non-aqueous sterile 

30 suspensions that can include suspending agents, solubilizers, thickening agents, 

stabilizers, and preservatives. In the practice of this invention, compositions can be 
administered, for example, by orally, topically, intravenously, intraperitoneally, 
intravesically or intrathecally. Optionally, the compositions are administered orally or 
nasally. The formulations of compounds can be presented in unit-dose or multi-dose 

54 



WO 00/06592 



PCT/US99/17099 



sealed containers, such as ampules and vials. Solutions and suspensions can be prepared 
from sterile powders, granules, and tablets of the kind previously described. The 
modulators can also be administered as part a of prepared food or drug. 

The dose administered to a patient, in the context of the present invention 
5 should be sufficient to effect a beneficial response in the subject over time. The dose will 
be determined by the efficacy of the particular taste modulators employed and the 
condition of the subject, as well as the body weight or surface area of the area to be 
treated. The size of the dose also will be determined by the existence, nature, and extent 
of any adverse side-effects that accompany the administration of a particular compound 

1 0 or vector in a particular subject. 

In determining the effective amount of the modulator to be administered in 
a physician may evaluate circulating plasma levels of the modulator, modulator 
toxicities,, and the production of anti-modulator antibodies. In general, the dose 
equivalent of a modulator is from about 1 ng/kg to 10 mg/kg for a typical subject. 

1 5 For administration, taste modulators of the present invention can be 

administered at a rate determined by the LD-50 of the modulator, and the side-effects of 
the inhibitor at various concentrations, as applied to the mass and overall health of the 
subject. Administration can be accomplished via single or divided doses. 

20 All publications and patent applications cited in this specification are 

herein incorporated by reference as if each individual publication or patent application 
were specifically and individually indicated to be incorporated by reference. 

Although the foregoing invention has been described in some detail by 
way of illustration and example for purposes of clarity of understanding, it will be readily 

25 apparent to one of ordinary skill in the art in light of the teachings of this invention that 
certain changes and modifications may be made thereto without departing from the spirit 
or scope of the appended claims. 

EXAMPLES 

30 The following examples are provided by way of illustration only and not 

by way of limitation. Those of skill in the art will readily recognize a variety of 
noncritical parameters that could be changed or modified to yield essentially similar 
results. 

55 



WO 00/06592 



PCTYUS99/17099 



Example I: Cloning and expression of GPCR-B3 

Since taste transduction occurs in taste receptor cells found in taste buds of 
the tongue and palate epithelium, a full-length cDNA library was generated from rat taste 
papillae. This library was made by oligo-dT priming of poly- A+ RNA isolated from 
5 several hundreds rat circumvallate papillae using a directional 1ZAP vector (Stratagene 
Inc; Hoon & Ryba, J. Dent Res. 76:831-838 (1997)) following standard molecular 
biology procedures (see, e.g., Ausubel et al, Current Protocols in Molecular Biology 
(1995). A collection of single-cell and single taste-bud cDNA libraries was also 
generated from individually isolated taste receptor cells and taste buds from rat and 

10 mouse circumvallate, foliate and fungiform papillae according to the method of Dulac & 
Axel, Cell 83:195-206 (1995). Taste buds and single taste receptor cells were isolated by 
enzymatic digestion and micro-dissection of lingual epithelium from adult rats and mice. 
To maximize lysis efficiency in the taste bud preparations, the lysis buffer volume was 
increased 10 fold (Dulac & Axel, supra). 

1 5 GPCR-B3 was isolated from the 1ZAP circumvallate cDNA library by first 

generating a subtracted library enriched in sequences expressed in taste tissue. 
Construction and initial analysis of a taste-receptor cell subtracted cDNA library was as 
described by Hoon & Ryba, supra. Further enrichment of taste-specific transcripts was 
achieved by dot-blot screening of cDNA clones with non-taste cDNA probes. Non-taste 

20 probes included lingual epithelium tissue devoid of taste buds, muscle, liver and brain 
tissue. The individual hybridization probes were generated by preparing first strand 
cDNA and labeling it using random priming methods as described in Ausubel et ai, 
supra. Hybridization conditions and washes were 65°C, 2x SSC for hybridizations, and 
65°C, O.lx SSC for washing. 

25 All cDNAs that showed taste tissue enrichment in the differential screens 

with taste and non-taste tissue were picked for DNA sequencing analysis using standard 
dideoxy-termination methods and an automated ABI sequencing machine. DNA 
sequences were subjected to data analysis using a variety of homology and structure 
prediction programs (e.g. blast at http://www.ncbi.nlm.nih.gov/ Tmpred at 

30 http://dot.imgen.bcm.tmc.edu:9331/seq-search/struc-predict.html). Individual cDNA 
clones that encoded novel sequences, sequences with some similarity to known signaling 
components, sequences with multiple predicted transmembrane domains, or sequences 
with known motifs such as SH2, SH3, PDZ, etc (see for example pfam at 
http://pfam.wustl.edu/) were chosen as candidates for further analysis. 
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Candidate cDNAs were assayed for taste-cell expression by in situ 
hybridization to tissue sections of rat tongue. Tissue was obtained from adult rats. Fresh 
frozen sections (14 mm) were attached to silanized slides and prepared for in situ 
hybridization as described by Ryba & Tirindelli, Neuron 19:371-379 (1997). All in situ 
5 hybridizations were carried out at high stringency (5x SSC, 50 % formamide, 72 °C). For 
single-label detection, signals were developed using alkaline-phosphatase conjugated 
antibodies to digoxigenin and standard chromogenic substrates (Boehringer Mannheim) 
as described by Ryba & Tirindelli, supra. Partial DNA sequencing reactions were 
performed on -2000 subtracted and single-cell cDNA clones, and in situ hybridizations 
1 0 were carried out with -400 different candidate cDNAs. This screen identified a number 
of genes expressed in taste receptor cells including a single clone encoding a 3' fragment 
ofGPCR-B3. 

Full-length rat GPCR-B3 was isolated from the 1ZAP rat circumvallate 
cDNA library following standard plaque hybridization protocols (Ausubel et al, supra). 

15 Approximately 2.5 xl06 clones were plated at high density on LB plates (-100,000 
phage/plate) and replica filters were hybridized with a radiolabeled GPCR-B3 probe at 
high stringency (65°C, 2x SSC). Positive clones were picked, retested, purified and 
characterized by DNA restriction mapping and sequencing analysis. Several full-length 
GPCR-B3 clones were isolated and characterized (see SEQ ID NOS:4-6 and the amino 

20 acid sequences that they encode, SEQ ID NOS: 1-3). 

The mouse interspecies homolog of rat GPCR-B3 was isolated by 
screening a mouse genomic Bac and 1 library (Genome Systems) at low and moderate 
stringency (48°C, 7x SSC and 55°C, 5x SSC). The clones were characterized by 
restriction mapping and DNA sequencing. A mouse cDNA was isolated by performing 

25 RACE reactions (Marathon Kit, Clonetech) using first-strand cDNA prepared from RNA 
isolated from mouse circumvallate and foliate papillae. The human homolog of GPCR- 
B3 was isolated from a human testis library (Clonetech Inc.) following the observation 
that other sensory receptors such as olfactory and visual receptors are expressed in testis 
(Axel & Dulac, supra). See Figure 1 for a topological map of GPCR-B3, showing the 

30 extracellular domain, seven transmembrane domains, and an intracellular or C-terminal 
domain. 
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Example II: Western blot and in situ analysis 

To demonstrate specific expression of GPCR-B3 protein in taste cells, 
antibodies were generated against short peptides and GPCR-B3 fusion proteins. The 
peptides consisted of 18 amino acid residues from the N- or C-terminal end of the GPCR- 
B3 predicted protein (see, e.g., SEQ ID NO:l and 2). The fusion proteins consisted of 
GST-fusion polypeptides encompassing the entire N-terminal domain or the last 3 
predicted transmembrane segments plus the C-term region. Fusions were generated using 
standard molecular techniques (Harlow & Lane, Antibodies (1988)). Peptides were fused 
to carrier proteins, immunized into rabbits, and the serum affinity purified and assayed as 
described by Cassill et al, Proa Natl Acad. ScL USA 88:1 1067-1 1070 (1991)). 

Antibodies were tested for specificity by western-blot analysis of protein 
homogenates from circumvallate or fungiform papillae. The blots also contained liver 
and brain protein extracts as negative controls. For immunohistochemistry, frozen 
sections were prepared as described by Ryba & Tirindelli, supra for in situ 
hybridizations, except that blocking reactions used 10 % donkey immunoglobulin, 1 % 
bovine serum albumin, 0.3% Triton X-100. Sections were incubated in the appropriate 
dilution of anti-TRl (1:100) for 12-18 hrs., and detected using fluorescein-conjugated 
donkey anti-rabbit secondary antibodies (Jackson Immuno laboratory). Taste buds were 
counter-stained with the F-actin marker BODIPYRTR-X phallacidin (Molecular Probes). 
As a control for these studies, anti-NCAM antibodies were also used. Fluorescent images 
were obtained using a Leica TSC confocal microscope with an argon-krypton laser. Pre- 
treatment of the antibodies with the peptide immunogen abolished staining. See Figures 2 
and 3 for western blot and in situ analysis, respectively. 
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1 . An isolated nucleic acid encoding a sensory transduction G-protein 
coupled receptor, the receptor comprising greater than about 70% amino acid identity to 

5 an amino acid sequence of SEQ ID NO: 1 , SEQ ID NO:2, or SEQ ID NO:3. 

2. The isolated nucleic acid of claim 1 , wherein the nucleic acid 
encodes a receptor that specifically binds to polyclonal antibodies generated against SEQ 
ID NO:l, SEQ ID NO:2, or SEQ ID NO:3. 

10 

3. The isolated nucleic acid of claim 1, wherein the nucleic acid 
encodes a receptor that has G-coupled protein receptor activity. 

4. The isolated nucleic acid of claim 1 , wherein the nucleic acid 

15 encodes a receptor comprising an amino acid sequence of SEQ ID NO:l, SEQ ID NO:2, 
orSEQIDNO:3. 

5. The isolated nucleic acid sequence of claim 1, wherein the nucleic 
acid comprises a nucleotide sequence of SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6. 

20 

6. The isolated nucleic acid of claim 1 , wherein the nucleic acid is 
from a human, a mouse, or a rat. 

7. The isolated nucleic acid of claim 1 , wherein the nucleic acid is 

25 amplified by primers that selectively hybridize under stringent hybridization conditions to 
the same sequence as degenerate primer sets encoding amino acid sequences selected 
from the group consisting of: 

IAWDWNGPKW (SEQ ID NO:7) and 

LPENYNEAKC (SEQ ID NO:8). 

30 

8. The isolated nucleic acid of claim 1 , wherein the nucleic acid 
encodes a receptor having a molecular weight of about between 92 kDa to about 102 kDa. 
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9. An isolated nucleic acid encoding a sensory transduction G-protein 
coupled receptor, wherein the nucleic acid specifically hybridizes under highly stringent 
conditions to a nucleic acid having the sequence of SEQ ID NO:4, SEQ ID NO:5, or SEQ 
IDNO:6. 

5 

1 0. An isolated nucleic acid encoding a sensory transduction G-protein 
coupled receptor, the receptor comprising greater than about 70% amino acid identity to a 
polypeptide having a sequence of SEQ ID NO: 1 , SEQ ID NO:2, or SEQ ID NO:3, 
wherein the nucleic acid selectively hybridizes under moderately stringent hybridization 

10 conditions to a nucleotide sequence of SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6. 

11. An isolated nucleic acid encoding an extracellular domain of a 
sensory transduction G-protein coupled receptor, the extracellular domain having greater 
than about 70% amino acid sequence identity to the extracellular domain of SEQ ID 

15 NO:l. 

1 2. The isolated nucleic acid of claim 1 1 , wherein the nucleic acid 
encodes the extracellular domain linked to a nucleic acid encoding a heterologous 
polypeptide, forming a chimeric polypeptide. 

20 

1 3 . The isolated nucleic acid of claim 1 1 , wherein the nucleic acid 
encodes the extracellular domain of SEQ ID NO: 1 . 

14. An isolated nucleic acid encoding a transmembrane domain of a 
25 sensory transduction G-protein coupled receptor, the transmembrane domain comprising 

greater than about 70% amino acid sequence identity to the transmembrane domain of 
SEQIDNO:l. 

15. The isolated nucleic acid of claim 14, wherein the nucleic acid 
30 encodes the transmembrane domain linked to a nucleic acid encoding a heterologous 

polypeptide, forming a chimeric polypeptide. 



16. The isolated nucleic acid of claim 14, wherein the nucleic acid 
encodes the transmembrane domain of SEQ ID NO: 1 . 
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1 7. The isolated nucleic acid of claim 14, wherein the nucleic acid 
further encodes a cytoplasmic domain comprising greater than about 70% amino acid 
identity to the cytoplasmic domain of SEQ ID NO:l. 

5 

18. The isolated nucleic acid of claim 1 7, wherein the nucleic acid 
encodes the cytoplasmic domain of SEQ ID NO: 1 . 

19. An isolated sensory transduction G-protein coupled receptor, the 
10 receptor comprising greater than about 70% amino acid sequence identity to an amino 

acid sequence of SEQ ID NO: 1 , SEQ ID NO:2, or SEQ ID NO:3. 

20. The isolated receptor of claim 1 9, wherein the receptor specifically 
binds to polyclonal antibodies generated against SEQ ID NO: 1, SEQ ID NO:2, or SEQ ID 

15 NO:3. 

2 1 . The isolated receptor of claim 1 9, wherein the receptor has G- 
protein coupled receptor activity. 

20 22. The isolated receptor of claim 19, wherein the receptor has an 

amino acid sequence of SEQ ID NO: 1 , SEQ ID NO:2, or SEQ ID NO:3. 

23. The isolated receptor of claim 19, wherein the receptor is from a 
human, a rat, or a mouse. 

25 

24. An isolated polypeptide comprising an extracellular domain of a 
sensory transduction G-protein coupled receptor, the extracellular domain comprising 
greater than about 70% amino acid sequence identity to the extracellular domain of SEQ 
IDNO:l. 

30 

25. The isolated polypeptide of claim 24, wherein the polypeptide 
encodes the extracellular domain of SEQ ID NO: 1 . 
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26. The isolated polypeptide of claim 24, wherein the extracellular 
domain is covalently linked to a heterologous polypeptide, forming a chimeric 
polypeptide. 

5 27. An isolated polypeptide comprising a transmembrane domain of a 

sensory transduction G-protein coupled receptor, the transmembrane domain comprising 
greater than about 70% amino acid sequence identity to the transmembrane domain of 
SEQIDNO:l. 

10 28. The isolated polypeptide of claim 27, wherein the polypeptide 

encodes the transmembrane domain of SEQ ID NO:l. 

29. The isolated polypeptide of claim 27, further comprising a 
cytoplasmic domain comprising greater than about 70% amino acid identity to the 

15 cytoplasmic domain of SEQ ID NO: 1. 

30. The isolated polypeptide of claim 29, wherein the polypeptide 
encodes the cytoplasmic domain of SEQ ID NO: 1 . 

20 31. The isolated polypeptide of claim 27, wherein the transmembrane 

domain is covalently linked to a heterologous polypeptide, forming a chimeric 
polypeptide. 

32. The isolated polypeptide of claim 31, wherein the chimeric 
25 polypeptide has G-protein coupled receptor activity. 

33. An antibody that selectively binds to the receptor of claim 19. 

34. An expression vector comprising the nucleic acid of claim 1. 

30 

35. A host cell transfected with the vector of claim 34. 

36. A method for identifying a compound that modulates sensory 

signaling in sensory cells, the method comprising the steps of: 
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(i) contacting the compound with a polypeptide comprising an 
extracellular domain of a sensory transduction G-protein coupled receptor, the 
extracellular domain comprising greater than about 70% amino acid sequence identity to 
the extracellular domain of SEQ ID NO:l, SEQ ID NO:2, or SEQ ID NO:3; and 
5 (ii) determining the functional effect of the compound upon the 

extracellular domain. 

37. The method of claim 36, wherein the polypeptide is a sensory 
transduction G-protein coupled receptor, the receptor comprising greater than about 70% 

1 0 amino acid identity to a polypeptide encoding SEQ ID NO: 1 , SEQ ID NO:2, or SEQ ID 
NO:3. 

38. The method of claim 37, wherein the polypeptide comprises an 
extracellular domain that is covalently linked to a heterologous polypeptide, forming a 

15 chimeric polypeptide. 

39. The method of claim 37 or 38, wherein the polypeptide has G- 
protein coupled receptor activity. 

20 40. The method of claim 36, wherein the extracellular domain is linked 

to a solid phase. 

41 . The method of claim 40, wherein the extracellular domain is 
covalently linked to a solid phase. 

25 

42. The method of claim 37 or 38, wherein functional effect is 
determined by measuring changes in intracellular cAMP, IP3, or Ca 2+ . 

43. The method of claim 36, wherein the functional effect is a chemical 

30 effect. 

44. The method of claim 36, wherein the functional effect is a physical 

effect. 
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45. The method of claim 36, wherein the functional effected is 
determined by measuring binding of the compound to the extracellular domain. 

46. The method of claim 36, wherein the polypeptide is recombinant. 

5 

47. The method of claim 36, wherein the polypeptide is from a rat, a 
mouse, or a human. 

48. The method of claim 37, wherein the polypeptide comprises an 
10 amino acid sequence of SEQ ID NO:l, SEQ ID NO:2, or SEQ ID NO:3. 

49. The method of claim 37 or 38, wherein the polypeptide is 
expressed in a cell or cell membrane. 

15 50. The method of claim 49, wherein the cell is a eukaryotic cell. 

51. A method for identifying a compound that modulates sensory 
signaling in sensory cells, the method comprising the steps of: 

(i) contacting the compound with a polypeptide comprising an 
20 extracellular domain of a sensory transduction G-protein coupled receptor, the 

transmembrane domain comprising greater than about 70% amino acid sequence identity 
to the extracellular domain of SEQ ID NO:l, SEQ ED NO:2, or SEQ ID NO:3; and 

(ii) determining the functional effect of the compound upon the 
transmembrane domain. 

52. The method of claim 5 1 , wherein the polypeptide comprises an 
transmembrane domain that is covalently linked to a heterologous polypeptide, forming a 
chimeric polypeptide. 

30 53. The method of claim 52, wherein the chimeric polypeptide has G- 

protein coupled receptor activity. 



54. The method of claim 51, wherein the functional effect is 

determined by measuring changes in intracellular cAMP, IP3, or Ca 2+ . 
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5 5 . The method of claim 5 1 , wherein the functional effect is a chemical 



15 



30 



effect. 



5 56. The method of claim 5 1 , wherein the functional effect is a physical 

effect. 

57. The method of claim 5 1 , wherein the polypeptide is recombinant. 

10 58. The method of claim 51, wherein the polypeptide is from a rat, a 

mouse, or a human. 

59. The method of claim 5 1 or 52, wherein the polypeptide is 
expressed in a cell or cell membrane. 



60. The method of claim 59, wherein the cell is a eukaryotic cell. 



61. A method of making a sensory transduction G-protein coupled 
receptor, the method comprising the step of expressing the receptor from a recombinant 

20 expression vector comprising a nucleic acid encoding the receptor, wherein the amino 
acid sequence of the receptor comprises greater than about 70% amino acid identity to a 
polypeptide having a sequence of SEQ ID NO:l, SEQ ID NO:2, or SEQ ID NO:3. 

62. A method of making a recombinant cell comprising a sensory 
25 transduction G-protein coupled receptor, the method comprising the step of transducing 

the cell with an expression vector comprising a nucleic acid encoding the receptor, 
wherein the amino acid sequence of the receptor comprises greater than about 70% amino 
acid identity to a polypeptide having a sequence of SEQ ID NO:l, SEQ ID NO:2, or SEQ 
IDNO:3. 



63. A method of making an recombinant expression vector comprising 

a nucleic acid encoding a sensory transduction G-protein coupled receptor, the method 

comprising the step of ligating to an expression vector a nucleic acid encoding the 

receptor, wherein the amino acid sequence of the receptor comprises greater than about 
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70% amino acid identity to a polypeptide having a sequence of SEQ ID NO:l, SEQ ID 
NO:2,orSEQIDNO:3. 
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AMENDED CLAIMS 
[received by the International Bureau on 1 1 November 1 999 (11.11 .99); 
original claim 51 amended; remaining claims unchanged (1 page)] 

45. The method of claim 36, wherein the functional effected is 
determined by measuring binding of the compound to the extracellular domain. 

46. The method of claim 36, wherein the polypeptide is recombinant. 

47. The method of claim 36, wherein the polypeptide is from a rat, a 
mouse, or a human. 



48. The method of claim 37, wherein the polypeptide comprises an 
10 amino acid sequence of SEQ ID NO:l, SEQ ID NO:2, or SEQ ID NO:3. 

49. The method of claim 37 or 38, wherein the polypeptide is 
expressed in a cell or cell membrane. 

15 50. The method of claim 49, wherein the cell is a eukaryotic cell. 

51. A method for identifying a compound that modulates sensory 
signaling in sensory cells, the method comprising the steps of: 

(i) contacting the compound with a polypeptide comprising a 

20 transmembrane domain of a sensory transduction G-protein coupled receptor, the 

transmembrane domain comprising greater than about 70% amino acid sequence identity 
to the transmembrane domain of SEQ ID NO: 1, SEQ ID NO:2, or SEQ ID NO:3; and 

(ii) determining the functional effect of the compound upon the 
transmembrane domain. 

25 

52. The method of claim 5 1 , wherein the polypeptide comprises an 
transmembrane domain that is covalently linked to a heterologous polypeptide, forming a 
chimeric polypeptide. 

30 53. The method of claim 52, wherein the chimeric polypeptide has G- 

protein coupled receptor activity. 

54. The method of claim 5 1 , wherein the functional effect is 

determined by measuring changes in intracellular cAMP, IP3, or Ca 2+ . 
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104 >^ 
80 >s. ^ 

48 

33 s. _ 
28 ^ 

m=markers 
tb=taste buds 
nt-t=non-taste from tongue 
nt=non-taste tissue 
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SEQUENCE LISTING 
Rat GPCR-B3 amino acid sequence-SEO ID NQ:1 

MLFWAAHLLLSLQLVYCWAFSCQRTESSPGFSLPGDFLLAGLFSLHGDCLQVRHRPLVTS 
5 CDRPDSFNGHGYHLFQAMRFTVEEINNSSALLPNITLGYELYDVCSESANVYATLRVI^ 
QGPRHIEIQKDLRNHSSKWAFIGPDNTDHAVTTAALLGPFLMPLVSYEAS^ 
FPSFLRTVPSDRHQVEVMVQLLQSFGWVWISLIGSYGDYGQLGVQALEELAVPRGICVAF 
KDIVPFSARVGDPRMQSMMQHLAQARTTVVVVFSNRHLA^ 

DWAISTYITSVTGIQGIGTVLGVAVQQRQVPGLKEFEESYVRAVTAAPSACPEGSWCSTN 
1 0 QLCRECHTFTTRNMPTLGAFSMSAAYRVYEAVYAVAHGLHQLLGCTSE I CSRGPVYPWQL 
LQQ I YKVNFLLHENTVAFDDNGDTLGYYD 1 1 AWDWNGPEWTFE I IGSASLSPVHLDINKT 
KIQWHGKNNQVPVSVCTTDCLAGHHRVWGSHHCCFECVPCEAGTFLNMSELH 
EEWAPKESTTCFPRTVEFLAWHEPISLVLIAANTLLLLLLVGTAGLFAWHFHTPVVRSAG 
GRLCFLMLGSLVAGSCSFYSFFGEPTVPACLLRQPLFSLGFAIFLSCLTIRSFQLVIIFK 
15 FSTKVPTFYRTWAQNHGAGLFVIVSSTVHLLICLTWLVMWTPRPTREYQRFPHLVILECT 
EVNS VGFLLAFTHNI LLS I STFVCSYLGKELPENYNEAKCVTFSLLLNFVSWI AFFTMAS 
IYQGSYLPAVNVLAGLTTLSGGFSGYFLPKCYVILCRPELNNTEHFQASIQDYTRRCGTT 

Mouse GPCR-P3 jinrino acid sequence-SEQ ID NO:2 

20 MLFWAAHLLLSLQLAVAYCWAFSCQRTESSPGFSLPGDFLLAGLFSLHADCLQVRHRPLV 
TSCDRSDSFNGHGYHLFQAMRFTVEEINNSTALLPNITLGYELYDVCSESSNVYATLRVP 
AQQGTGHLEMQRDLRNHS S KWAL I GPDNTDHAVTTAALLS PFLMPLVS YEAS S VI LSGK 
RKFPSFLRTIPSDKYQVEVIVRLLQSFGWVWISLVGSYGDYGQLGVQALEELATPRGICV 
AFKDWPLSAQAGDPRMQRMMLRLARARTTVVWFSNRHLAGVFFRSWI^ 

25 SEDWAISTYITNVPGIQGIGTVLGVAIQQRQVPGLKEFEESYVQAVMGAPRTCPEGSWCG 
TNQLCRECHAFTTWNMPELGAFSMSAAYNVYEAVYAVAHGLHQLLGCTSGTCARGPVYPW 
QLLQQIYKVNFLLHKKTVAFDDKGDPLGYYDIIAWDWNGPEWTFEVIGSASLSPVHLDIN 
KTKIQWHGKNNQVPVSVCTRDCLEGHHRLVMGSHHCCFECMPCEAGTFLNTSELHTCQPC 
GTEEWAPEGSSACFSRTVEFLGWHEPISLVLLAANTLLLLLLIGTAGLFAWRLHTPVVRS 

30 AGGRLCFLMLGSLVAGSCSLYSFFGKPTVPACLLRQPLFSLGFAIFLSCLTIRSFQLVII 
FKFSTKVPTFYHTWAQNHGAGIFVIVSSTVHLFLCLTWLAMWTPRPTREYQRFPHLV 
CTEVNSVGFLVAFAHNILLSISTFVCSYLGKELPENYNEAKCVTFSLLLHFVSWIAFFTM 
SSIYQGSYLPAVNVLAGLATLSGGFSGYFLPKCYVILCRPELNNTEHFQASIQDYTRRCG 
TT 



WO 00/06592 



PCT/US99/17099 



Human GPCR-B3 amino acid seauence-SEO ID NO:3 

RSCSFNEHGYHLFQAMRLGVEEINNSTALLPNITLGYQLYDVCSDSANVYATLRVLSLPG 
QHHI ELQGDLLHYS PTVLAVIGPDSTNRAATTAALLS PFLVHI SYAAS SETLSVKRQYPS 
FLRTIPNDKYQVETMVLLLQKFGWTWISLVGSSDDYGQLGVQALENQALVRGICIAFKDI 
MPFSAQVGDERMQCLMRHLAQAGATWWFSSRQLARVFFESWLTNLTGKVWVASEAWA 
LSRHITGVPGIQRIGMVLGVAIQKRAVPGLKAFEEAYARADKEAPRPCHKGSWCSSNQLC 
RECQAFMT^HTMPKLKAFSMSSAYNAYRAVYAVAHGLHQLLGCASELCSRGRVYPWQLLEQ 
IHKVHFLLHKDTVAFNDNRDPLSSYNI IAWDWNGPKWTFTVLGSSTWSPVQLNINETKIQ 
WHGKNHQVPKSVCSSDCLEGHQRWTGFHHCCFECVPCGAGTFLNKSELYRCQPCGTEEW 
APEGSQTCFPRTVVFLALREHTSWVLLAANTLLLLLLLGTAGLFAWHLDTPVVRSAGGRL 
CFLMLGSLAAGSGSLYGFFGEPTRPACLLRQALFALGFTIFLSCLTVRSFQLI I IFKFST 
KVPTFYHAWVQNHGAGLFVMISSAAQLLICLTWLWWTPLPAREYQRFPHLVMLECTETN 
SLGFILAFLYNGLLSISAFACSYLGKDLPENYNEAKCVTFSLLFNFVSWIAFFTTASVYD 
GKYLPAANMMAGLS SLSSGFGGYFLPKCYVI LCRPDLNSTEHFQAS I QDYTRRCGST 

Rat GPCR-B3 nucleotide seguence-SEQ ID NO:4 

ATTCACATCAGAGCTGTGCTCAGCCATGCTGGGCAGAGGGACGACGGCTGGCCAGCATGC 

TCTTCTGGGCTGCTCACCTGCTGCTCAGCCTGCAGTTGGTCTACTGCTGGGCTTTCAGCT 

20 GCCAAAGGACAGAGTCCTCTCCAGGCTTCAGCCTTCCTGGGGACTTCCTCCTTGCAGGTC 

TGTTCTCCCTCCATGGTGACTGTCTGCAGGTGAGACACAGACCTCTGGTGACAAGTTGTG 

ACAGGCCCGACAGCTTCAACGGCCATGGCTACCACCTCTTCCAAGCCATGCGGTTCACTG 

TTGAGGAGATAAACAACTCCTCGGCCCTGCTTCCCAACATCACCCTGGGGTATGAGCTGT 

ACGACGTGTGCTCAGAATCTGCCAATGTGTATGCCACCCTGAGGGTGCTTGCCCTGCAAG 

25 GGCCCCGCCACATAGAGATACAGAAAGACCTTCGCAACCACTCCTCCAAGGTGGTGGCCT 

TCATCGGGCCTGACAACACTGACCACGCTGTCACTACCGCTGCCTTGCTGGGTCCTTTCC 

TGATGCCCCTGGTCAGCTATGAGGCAAGCAGCGTGGTACTCAGTGCCAAGCGCAAGTTCC 

CGTCTTTCCTTCGTACCGTCCCCAGTGACCGGCACCAGGTGGAGGTCATGGTGCAGCTGC 

TGCAGAGTTTTGGGTGGGTGTGGATCTCGCTCATTGGCAGCTACGGTGATTACGGGCAGC 

30 TGGGTGTGCAGGCGCTGGAGGAGCTGGCCGTGCCCCGGGGCATCTGCGTCGCCTTCAAGG 

ACATCGTGCCTTTCTCTGCCCGGGTGGGTGACCCGAGGATGCAGAGCATGATGCAGCATC 

TGGCTCAGGCCAGGACCACCGTGGTTGTGGTCTTCTCTAACCGGCACCTGGCTAGAGTGT 

TCTTCAGGTCCGTGGTGCTGGCCAACCTGACTGGCAAAGTGTGGGTCGCCTCAGAAGACT 

GGGCCATCTCCACGTACATCACCAGCGTGACTGGGATCCAAGGCATTGGGACGGTGCTCG 

2 
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GTGTGGCCGTCCAGCAGAGACAAGTCCCTGGGCTGAAGGAGTTTGAGGAGTCTTATGTCA 
GGGCTGTAACAGCTGCTCCCAGCGCTTGCCCGGAGGGGTCCTGGTGCAGCACTAACCAGC 
TGTGCCGGGAGTGCCACACGTTCACGACTCGTAACATGCCCACGCTTGGAGCCTTCTCCA 
TGAGTGCCGCCTACAGAGTGTATGAGGCTGTGTACGCTGTGGCCCACGGCCTCCACCAGC 
5 TCCTGGGATGTACTTCTGAGATCTGTTCCAGAGGCCCAGTCTACCCCTGGCAGCTTCTTC 
AGCAGATCTACAAGGTGAATTTTCTTCTACATGAGAATACTGTGGCATTTGATGACAACG 
GGGACACTCTAGGTTACTACGACATCATCGCCTGGGACTGGAATGGACCTGAATGGACCT 
TTGAGATCATTGGCTCTGCCTCACTGTCTCCAGTTCATCTGGACATAT^ATAAGACAAAAA 
TCCAGTGGCACGGGAAGAACAATCAGGTGCCTGTGTCAGTGTGTACCACGGACTGTCTGG 

1 0 CAGGGCACCACAGGGTGGTTGTGGGTTCCCACCACTGCTGCTTTGAGTGTGTGCCCTGCG 
AAGCTGGGACCTTTCTCAACATGAGTGAGCTTCACATCTGCCAGCCTTGTGGAACAGAAG 
AATGGGCACCCAAGGAGAGCACTACTTGCTTCCCACGCACGGTGGAGTTCTTGGCTTGGC 
ATGAACCCATCTCTTTGGTGCTAATAGCAGCTAACACGCTATTGCTGCTGCTGCTGGTTG 
GGACTGCTGGCCTGTTTGCCTGGCATTTTCACACACCTGTAGTGAGGTCAGCTGGGGGTA 

15 GGCTGTGCTTCCTCATGCTGGGTTCCCTGGTGGCCGGAAGTTGCAGCTTCTATAGCTTCT 
TCGGGGAGCCCACGGTGCCCGCGTGCTTGCTGCGTCAGCCCCTCTTTTCTCTCGGGTTTG 
CCATCTTCCTCTCCTGCCTGACAATCCGCTCCTTCCAACTGGTCATCATCTTCAAGTTTT 
CTACCAAGGTGCCCACATTCTACCGTACCTGGGCCCAAAACCATGGTGCAGGTCTATTCG 
TCATTGTCAGCTCCACGGTCCATTTGCTCATCTGTCTCACATGGCTTGTAATGTGGACCC 

20 CACGACCCACCAGGGAATACCAGCGCTTCCCCCATCTGGTGATTCTCGAGTGCACAGAGG 
TCAACTCTGTAGGCTTCCTGTTGGCTTTCACCCACAACATTCTCCTCTCCATCAGTACCT 
TCGTCTGCAGCTACCTGGGTAAGGAACTGCCAGAGAACTATAATGAAGCCAAATGTGTCA 
CCTTCAGCCTGCTCCTCAACTTCGTATCCTGGATCGCCTTCTTCACCATGGCCAGCATTT 
ACCAGGGCAGCTACCTGCCTGCGGTCAATGTGCTGGCAGGGCTGACCACACTGAGCGGCG 

25 GCTTCAGCGGTTACTTCCTCCCCAAGTGCTATGTGATTCTCTGCCGTCCAGAACTCAACA 
ATACAGAACACTTTCAGGCCTCCATCCAGGACTACACGAGGCGCTGCGGCACTACCTGAT 
CCACTGGAAAGGTGCAGACGGGAAGGAAGCCTCTCTTCTTGTGCTGAAGGTGGCGGGTCC 
AGTGGGGCCGAGAGCTTGAGGTGTCTGGGAGAGCTCCGGCACAGCTTACGATGTATAAGC 
ACGCGGAAGAATCCAGTGCAATAAAGACGGGAAGTGTGAAAAAAAAAAAAAAAAAAAAAA 

30 AAAAAAAAAAA 

Mouse GPCR-B3 nucleotide sequence-SEO ID NO:5 

TTTGGCCAGCATGCTTTTCTGGGCAGCTCACCTGCTGCTCAGCCTGCAGCTGGCCGTTGC 
TTACTGCTGGGCTTTCAGCTGCCAAAGGACAGAATCCTCTCCAGGTTTCAGCCTCCCTGG 
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GGACTTCCTCCTGGCAGGCCTGTTCTCCCTCCATGCTGACTGTCTGCAGGTGAGACACAG 
ACCTCTGGTGACAAGTTGTGACAGGTCTGACAGCTTCAACGGCCATGGCTATCACCTCTT 
CCAAGCCATGGGGTTCACCGTTGAGGAGATAAACAACTCCACAGCTCTGCTTCCCAACAT 
CACCCTGGGGTATGAACTGTATGACGTGTGCTCAGAGTCTTCCAATGTCTATGCCACCCT 
5 GAGGGTGCCCGCCCAGCAAGGGACAGGCCACCTAGAGATGCAGAGAGATCTTCGCAACCA 
CTCCTCCAAGGTGGTGGCACTCATTGGGCCTGATAACACTGACCACGCTGTCACCACTGC 
TGCCCTGCTGAGCCCTTTTCTGATGCCCCTGGTCAGCTATGAGGCGAGCAGCGTGATCCT 
CAGTGGGAAGCGCAAGTTCCCGTCCTTCTTGCGCACCATCCCCAGCGATAAGTACCAGGT 
GGAAGTCATAGTGCGGCTGCTGCAGAGCTTCGGCTGGGTCTGGATCTCGCTCGTTGGCAG 

10 CTATGGTGACTACGGGCAGCTGGGCGTACAGGCGCTGGAGGAGCTGGCCACTCCACGGGG 
GATCTGCGTCGCCTTCAAGGACGTGGTGCCTCTCTCCGCCCAGGCGGGTGACCCAAGGAT 
GCAGCGCATGATGCTGCGTCTGGCTCGAGCCAGGACCACCGTGGTCGTGGTCTTCTCTAA 
CCGGCACCTGGCTGGAGTGTTCTTCAGGTCTGTGGTGCTGGCCAACCTGACTGGCAAAGT 
GTGGATCGCCTCCGAAGACTGGGCCATCTCCACGTACATCACCAATGTGCCCGGGATCCA 

15 GGGCATTGGGACGGTGCTGGGGGTGGCCATCCAGCAGAGACAAGTCCCTGGCCTGAAGGA 
GTTTGAAGAGTCCTATGTCGAGGCAGTGATGGGTGCTCCCAGAACTTGCCCAGAGGGGTC 
CTGGTGCGGCACTAACCAGCTGTGCAGGGAGTGTCACGCTTTCACGACATGGAACATGCC 
CGAGCTTGGAGCCTTCTCCATGAGCGCTGCCTACAATGTGTATGAGGCTGTGTATGCTGT 
GGCCCACGGCCTCCACCAGCTCCTGGGATGTACCTCTGGGACCTGTGCCAGAGGCCCAGT 

20 CTACCCCTGGCAGCTTCTTCAGCAGATCTACAAGGTGAATTTCCTTCTACATAAGAAGAC 
TGTAGCATTCGATGACAAGGGGGACCCTCTAGGTTATTATGACATCATCGCCTGGGACTG 
GAATGGACCTGAATGGACCTTTGAGGTCATTGGTTCTGCCTCACTGTCTCCAGTTCATCT 
AGACATAAATAAGACAAAAATCCAGTGGCACGGGAAGAACAATCAGGTGCCTGTGTCAGT 
GTGTACCAGGGACTGTCTCGAAGGGCACCACAGGTTGGTCATGGGTTCCCACCACTGCTG 

25 CTTCGAGTGCATGCCCTGTGAAGCTGGGACATTTCTCAACACGAGTGAGCTTCACACCTG 
CCAGCCTTGTGGAACAGAAGAATGGGCCCCTGAGGGGAGCTCAGCCTGCTTCTCACGCAC 
CGTGGAGTTCTTGGGGTGGCATGAACCCATCTCTTTGGTGCTATTAGCAGCTAACACGCT 
ATTGCTGCTGCTGCTGATTGGGACTGCTGGCCTGTTTGCCTGGCGTCTTCACACGCCTGT 
TGTGAGGTCAGCTGGGGGTAGGCTGTGCTTCCTCATGCTGGGTTCCTTGGTAGCTGGGAG 

30 TTGCAGCCTCTACAGCTTCTTCGGGAAGCCCACGGTGCCCGCGTGCTTGCTGCGTCAGCC 
CCTCTTTTCTCTCGGGTTTGCCATTTTCCTCTCCTGTCTGACAATCCGCTCCTTCCAACT 
GGTCATCATCTTCAAGTTTTCTACCAAGGTACCCACATTCTACCACACTTGGGCCCAAAA 
CCATGGTGCCGGAATATTCGTCATTGTCAGCTCCACGGTCCATTTGTTCCTCTGTCTCAC 
GTGGCTTGCAATGTGGACCCCACGGCCCACCAGGGAGTACCAGCGCTTCCCCCATCTGGT 
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GATTCTTGAGTGCACAGAGGTCAACTCTGTGGGCTTCCTGGTGGCTTTCGCACACAACAT 
CCTCCTCTCCATCAGCACCTTTGTCTGCAGCTACCTGGGTAAGGAACTGCCGGAGAACTA 
TAACGAAGCCAAATGTGTCACCTTCAGCCTGCTCCTCCACTTCGTATCCTGGATCGCTTT 
CTTCACCATGTCCAGCATTTACCAGGGCAGCTACCTACCCGCGGTCAATGTGCTGGCAGG 
5 GCTGGCCACTCTGAGTGGCGGCTTCAGCGGCTATTTCCTCCCTAAATGCTACGTGATTCT 
CTGCCGTCCAGAACTCAACAACACAGAACACTTTCAGGCCTCCATCCAGGACTACACGAG 
GCGCTGCGGCACTACCTGAGGCGCTGCGGCACTACCTGAGGCGCTGCGGCACTACCTGA 

Human GPCR-B3 nucleotide sequence-SEO ID NO:6 

1 0 AGGTCTTGTAGCTTCAATGAGCATGGCTACCACCTCTTCCAGGCTATGCGGCTTGGGGTT 
GAGGAGATAAACAACTCCACGGCCCTGCTGCCCAACATCACCCTGGGGTACCAGCTGTAT 
GATGTGTGTTCTGACTCTGCCAATGTGTATGCCACGCTGAGAGTGCTCTCCCTGCCAGGG 
CAACACCACATAGAGCTCCAAGGAGACCTTCTCCACTATTCCCCTACGGTGCTGGCAGTG 
ATTGGGCCTGACAGCACCAACCGTGCTGCCACCACAGCCGCCCTGCTGAGCCCTTTCCTG 

15 GTGCATATTAGCTATGCGGCCAGCAGCGAGACGCTCAGCGTGAAGCGGCAGTATCCCTCT 
TTCCTGCGCACCATCCCCAATGACAAGTACCAGGTGGAGACCATGGTGCTGCTGCTGCAG 
AAGTTCGGGTGGACCTGGATCTCTCTGGTTGGCAGCAGTGACGACTATGGGCAGCTAGGG 
GTGCAGGCACTGGAGT^ACCAGGCCCTGGTCAGGGGCATCTGCATTGCTTTCAAGGACATC 
ATGCCCTTCTCTGCCCAGGTGGGCGATGAGAGGATGCAGTGCCTCATGCGCCACCTGGCC 

20 CAGGCCGGGGCCACCGTCGTGGTTGTTTTTTCCAGCCGGCAGTTGGCCAGGGTGTTTTTC 
GAGTCCGTGGTGCTGACCAACCTGACTGGCAAGGTGTGGGTCGCCTCAGAAGCCTGGGCC 
CTCTCCAGGCACATCACTGGGGTGCCCGGGATCCAGCGCATTGGGATGGTGCTGGGCGTG 
GCCATCCAGAAGAGGGCTGTCCCTGGCCTGAAGGCGTTTGAAGAAGCCTATGCCCGGGCA 
GACAAGGAGGCCCCTAGGCCTTGCACAAGGGCTCCTGGTGCAGCAGCAATCAGCTCTGCA 

25 GAGAATGCCAAGCTTTCATGGCACACACGATGCCCAAGCTCAAAGCCTTCTCCATGAGTT 
CTGCCTACAACGCATACCGGGCTGTGTATGCGGTGGCCCATGGCCTCCACCAGCTCCTGG 
GCTGTGCCTCTGAGCTCTGTTCCAGGGGCCGAGTCTACCCCTGGCAGCTTTTGGAGCAGA 
TC(^CAAGGTGCATTTCCTTCTA(^C^GGAC^CTGTGGCGTTTAATGA(^ACAGAGATC 
CCCTCAGTAGCTATAACATAATTGCCTGGGACTGGAATGGACCCAAGTGGACCTTCACGG 

30 TCCTCGGTTCCTCCACATGGTCTCCAGTTCAGCTAAACATAAATGAGACCAAAATCCAGT 
GGCACGGAAAGAACCACCAGGTGCCTAAGTCTGTGTGTTCCAGCGACTGTCTTGAAGGGC 
ACCAGCGAGTGGTTACGGGTTTCCATCACTGCTGCTTTGAGTGTGTGCCCTGTGGGGCTG 
GGACCTTCCTCAACAAGAGCGAGCTCTACAGATGCCAGCCTTGTGGAACAGAAGAGTGGG 
CACCTGAGGGAAGCCAGACCTGCTTCCCGCGCACTGTGGTGTTTTTGGCTTTGCGTGAGC 
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ACACCTCTTGGGTGCTGCTGGCAGCTAACACGCTGCTGCTGCTGCTGCTGCTTGGGACTG 
CTGGCCTGTTTGCCTGGCACCTAGACACCCCTGTGGTGAGGTCAGCAGGGGGCCGCCTGT 
GCTTTCTTATGCTGGGCTCCCTGGCAGCAGGTAGTGGCAGCCTCTATGGCTTCTTTGGGG 
AACCCACAAGGCCTGCGTGCTTGCTACGCCAGGCCCTCTTTGCCCTTGGTTTCACCATCT 
5 TCCTGTCCTGCCTGACAGTTCGCTCATTCCAACTAATCATCATCTTCAAGTTTTCCACCA 
AGGTACCTACATTCTACCACGCCTGGGTCCAAAACCACGGTGCTGGCCTGTTTGTGATGA 
TCAGCTCAGCGGCCCAGCTGCTTATCTGTCTAACTTGGCTGGTGGTGTGGACCCCACTGC 
CTGCTAGGGAATACCAGCGCTTCCCCCATCTGGTGATGCTTGAGTGCACAGAGACCAACT 
CCCTGGGCTTCATACTGGCCTTCCTCTACAATGGCCTCCTCTCCATCAGTGCCTTTGCCT 
10 GCAGCTACCTGGGTAAGGACTTGCCAGAGAACTACAACGAGGCCAAATGTGTCACCTTCA 
GCCTGCTCTTCAACTTCGTGTCCTGGATCGCCTTCTTCACCACGGCCAGCGTCTACGACG 
GCAAGTACCTGCCTGCGGCCAACATGATGGCTGGGCTGAGCAGCCTGAGCAGCGGCTTCG 
GTGGGTATTTTCTGCCTAAGTGCTACGTGATCCTCTGCCGCCCAGACCTCAACAGCACAG 
AGCACTTCCAGGCCTCCATTCAGGACTACACGAGGCGCTGCGGCTCCACCTGA 

15 
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